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Abstract 



This thesis contains results on different questions in quantum information. It con- 
sists of four chapters. The subject of the first chapter is the copying of quantum 
states by stimulated emission. According to the no-cloning theorem by Wootters 
and Zurek[92] it is fundamentally impossible to build a machine which would be able 
to produce an exact copy of a quantum system in an unknown state. The impossi- 
bility of perfect copying follows immediately from the linearity of quantum physics. 
Approximate copying however is compatible with the principles of quantum mechan- 
ics. Quantum mechanics only gives bounds on the fidelity of the copies. Stimulated 
emission, which is at the heart of the laser, is a natural candidate for the practi- 
cal realization of a quantum copier. Here it is shown that optimal (i.e. saturating 
the quantum mechanical bounds) copying of photons can be realized by stimulated 
emission in simple quantum optical systems, for example three-level atoms. The 
fidelity of the copies is limited by the unavoidable presence of spontaneous emission, 
which thus assures that the quantum mechanical bounds are obeyed. 

In spite of its non-local features such as the violation of Bell's inequalities, quantum 
physics is entirely compatible with the special theory of relativity. In particular, 
entangled states cannot be used for superluminal communication. This peaceful co- 
existence has led to the question whether the impossibility of superluminal signaling 
could be used as an axiom in deriving basic features of quantum mechanics from 
fundamental principles. In the second chapter we show that this is indeed the case. 
If the kinematical features of quantum physics including the projection postulate are 
assumed to be given, then its dynamical rules can be derived with the help of the no- 
signaling condition. This also puts constraints on possible non-linear modifications 
of quantum mechanics. 

Quantum mechanics usually only allows statistical predictions for the behavior of in- 
dividual physical systems. The third and fourth chapter of this thesis arc devoted to 
theorems on the existence of hidden variables which would make it possible to make 
predictions for individual systems. One of the classical hidden-variable theorems is 
the one by Kochen and Specker, which states that so called non-contextual hidden 
variables are incompatible with quantum mechanics. A new, much simplified, ver- 
sion of this theorem is given, which leads to a proposal for a simple experimental 
test of non- contextual hidden variables, for example with single photons and linear 
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optical elements. 

The fourth chapter treats the derivation of hidden- variable theorems for real exper- 
iments, in particular for finite measurement precision. This investigation was mo- 
tivated by recent claims that the Kochen-Specker theorem loses its validity under 
such conditions. It is shown that the basic statements of hidden-variable theorems 
are robust under real-world conditions. 
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Preface 



Let me begin with some remarks on how I ended up doing precisely the things 
which are collected in this thesis, and not something else. The emphasis of my 
undergraduate studies in Vienna and then also in Paris was on theoretical particle 
physics. I had chosen this subject at the beginning of my studies because it seemed 
the most fundamental area of physics. 

But already as an undergraduate I became very fascinated by the mysterious features 
of quantum mechanics. I remember that I first heard about Bell's inequahties from 
Robin Michaels, then a mathematics student in Cambridge, shortly after beginning 
my studies. I already knew the basic principles of quantum mechanics at that 
time, but I had had a naive realistic view concerning its statistical predictions, as I 
realized through our discussion. Later, I attended the seminar on the Foundations of 
Quantum Mechanics organized by Reinhold Bertlmann and Anton Zeilinger, which 
was my first contact with Anton. 

Towards the end of my studies I reahzed that I wanted to learn more about the 
fundamental questions of quantum mechanics, and, if possible, work on them. I 
thought that the most fundamental question was whether there is something be- 
yond quantum mechanics, or whether we have to content ourselves with its highly 
idiosyncratic ways of giving us information about the world. I was aware that I 
would most probably not be able to answer this question during my PhD, but it 
certainly was the guiding star of my decision. I was also aware that the question is an 
experimental one. Up to this point, I had not learned very much about experiments. 

Given all this, it was quite natural that I joined Anton's group in Innsbruck in De- 
cember 1997. It is worth mentioning that I arrived there more or less simultaneously 
with the first TV crews wanting to know about teleportation. I was determined to 
learn as much as possible about the experimental side. I was lucky enough to spend 
the first year working as an apprentice on the up to now best experimental test of 
Bell's inequalities, Gregor Weihs' PhD experiment. Although my time as an ex- 
perimentalist was actually not very long, it was a very valuable experience. I am 
convinced that it made me a better physicist, also in theory. 

I had always intended to do some theoretical work on the side. Our move from 
Innsbruck to Vienna created some additional spare time. Both my work on cloning 
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and on a Kochen-Specker experiment was started in 1998, while we were still in 
Innsbruck. Anton proposed a Kochen-Spcckcr experiment as a possible topic for 
my thesis shortly after my arrival in Innsbruck, and he also suggested the work of 
Cabello and Garcia- Alcaine as a starting point. I still remember the first discussions 
with Marek Zukowski, Harald Weinfurter and Anton quite vividly. We finished this 
project much later, in March 2000. 

My interest in cloning was triggered at a European Quantum Information meeting 
in Helsinki. Gregor told me that in Innsbruck they had been discussing the relation 

of cloning and stimulated emission before, which started our common work on this 
subject. 1 remember that from the beginning Anton was interested in the relation 
between cloning and supcrluminal communication. The year after that we continued 
(and in a sense completed) our work together with Julia Kempe. 

During my undergraduate studies I had not heard much about the new field of 
quantum information. I remember reading an introduction to quantum computing 
(by Adriano Barenco) in Paris, which I had downloaded from the quant-ph folder. 
In Innsbruck, somewhat unexpectedly, I found myself in one of the centers of the 
new field. There was not only our group but also our theoretical and experimental 
colleagues, with Peter ZoUer, Ignacio Cirac and Rainer Blatt, so there was a lot to 
learn. 

Today, when asked what 1 do, 1 often call myself a theorist in quantum information, 
and I will soon be a postdoc in a "centre for quantum computation" . I have remained 
true to my foundational interests, as illustrated by my work with Caslav Brukner and 
Anton on hidden- variable theorems for real experiments, and my work with Vladimir 
Buzek and Nicolas Gisin on the no-signaling condition. It was of course very helpful 
to have a boss like Anton, who is himself so deeply fascinated by quantum physics. 
I am curious to see which unexpected turns the future will bring. Meanwhile I 
hope that the products of my efforts collected here will be of interest to some of my 
colleagues. 
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1.1 Introduction — Quantum Information 

The first part of this thesis is concerned with quantum cloning, in particular with the 
realization of quantum cloning using stimulated emission. As our work on this topic 
was usually published under the heading of quantum information in the respective 
journals it seems appropriate to start with a few remarks on quantum information 
in general [12]. 

What is quantum information? Let me try to give two tentative definitions, both 
formulated as questions. The first one may seem rather too broad, the second 
one rather too narrow. The first runs as follows: what can you do with quantum 
mechanics that you cannot do classically? This describes the spirit of the field 
rather well, but it doesn't quite explain the name quantum information. The second 
runs: what happens if one has qubits instead of classical bits? As a definition 
of quantum information this is certainly too restricted, but some of its proudest 
achievements, such as the celebrated quantum computing algorithms, fit very well 
into this framework. Furthermore, this definition clearly emphasizes the information 
processing aspect of the field. 

Nowadays, everybody knows what a bit is. Physically a bit is represented by a 
system with two possible states which are clearly distinguishable, conventionally 
denoted as and 1. There are of course many possible physical implementations, 
from smoke signals over pulses of voltage to zones of magnetization on a hard disk. 
A common feature of all these implementations is that the system can always be 
determined to be in one of the two relevant states, or 1. If this is not possible, we 
are dealing with a bad implementation. Now consider a quantum system, which can 
also be in two clearly distinguishable, i.e. orthogonal states, |0) and Then it 
follows from the basic principles of quantum mechanics that all properly normalized 
superposition states 

a|0) + /?|l), with |a|2 + |/3|2 = l (l.l) 
are also possible physical states of the system. In the following we will denote such 
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two-dimensional systems as qubits. 

The distinction between qubits and classical bits becomes even more pronounced 
when several systems are considered. Two classical bits can be in 4 different states: 
00, 01, 10 and 11. For qubits any linear combination of the corresponding basis 
states, |00), |01), |10), and |11), corresponds to a possible state of the physical 
system. This means that the two qubits can also be in entangled states, such as 



Entangled states have been known for a long time to exhibit phenomena which arc 
entirely incompatible with the world-view of classical physics, such as the violation 
of Bell's inequahties [96, 7] . It may therefore not seem too surprising that replacing 
bits by qubits can lead to rather different new results. 

The most important theoretical developments in the field of quantum information 
have been the discovery of quantum cryptography and of the quantum computing 
algorithms by Shor [83] and Grover [48]. Quantum cryptography [11] establishes 
an entirely secure communication channel between two distant parties. It is based 
on the fact that in quantum mechanics in general there is no way of performing a 
measurement without disturbing the state of the system. This implies that in an 
appropriately designed scheme, any eavesdropper trying to listen in will always be 
detected. There is no parallel to quantum cryptography in the realm of classical 
physics. In the past few years there have been many quantum cryptography exper- 
iments of increasing practicality and sophistication [54, 71, 87]. Of all the quantum 
information paradigms, quantum cryptography has certainly come closest to being 
a usable technology. 

The first milestone in the field of quantum computation, i.e. computation based 
on qubits instead of classical bits, was Shor's discovery of his factoring algorithm 
[83]. Shor showed that on a quantum computer it is possible to factor large numbers 
by performing a number of operations that scales only polynomially with the size 
of the input, whereas the fastest known classical algorithms require an exponential 
number of steps. This was the first serious indication that in the long run quantum 
computers may be dramatically faster than classical computers. Since then a lot 
of work has been invested in trying to find other relevant algorithms which show a 
similar exponential speedup. So far these attempts have not been successful. How- 
ever, Grover [48] succeeded in showing that quantum computers also out-perform 
classical computers in solving a very common task, namely in searching a completely 
unordered database - imagine for example that you are given a phone number and 
telephone directory and you want to find out which name the number belongs to. 
Although the speedup is less dramatic in this case than in the case of factoring. 




(1.2) 
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Grover's discovery may well turn out to be of great practical importance. It should 
be noted, however, that it requires the database to be given in the form of a quantum 
mechanical superposition state. 

As the existence of entangled states is one of the most important differences between 
quantum and classical mechanics, it is not too surprising that in both these quantum 
algorithms entanglement plays a decisive role. 

As for the practical implementation of quantum computing, there is still a long 
way to go. However, simple algorithms using a small number of qubits have been 
realized in various physical systems, such as in nuclear magnetic resonance [32], 
cavity QED [88, 49] and ion traps [26, 69]. It is fair to say that nobody currently 
knows whether it will be possible to build a large-scale quantum computer working 
with hundreds or thousands of qubits. The main practical difficulty is decoherence 
[98] . Under normal circumstances, a multi-qubit entangled state would immediately 
be destroyed by the interaction with its environment. It is virtually impossible to 
shield a quantum computer from its environment to such a degree as to prevent this 
from happening. However, there is hope, based on the development of quantum error 
correction [81, 25, 86] and fault tolerant quantum computation [82]. Using methods 
that are similar to classical error-correction techniques, but much more subtle, it 
is possible to detect and correct errors occurring during the quantum computation, 
including those caused by decoherence. However, these schemes already presume the 
existence of quantum computer elements which work quite reliably, that is, where 
the error probabilities are below certain threshold values, typically far below the 
percent level (per operation). 

Let us now come back to the differences between classical bits and qubits. One very 
important difference is the following: The state of an unknown bit is very easy to 
determine because it can only be or 1. On the other hand it is completely impos- 
sible to determine the state of an unknown qubit if one is given only a single copy 
of the system. This is easy to see: the unknown qubit could be in any superposition 
state a|0) + and it turns out that the best thing that the observer can do is to 
perform a projective measurement in some basis of the Hilbert space spanned by the 
states |0) and |1) [62]. Such a measurement only gives him one bit of information 
while he would need an infinity of bits to exactly determine a and (5. Interestingly, 
it is possible to teleport an unknown qubit to a distant location with the help of 
quantum entanglement [10, 13]. This can be done without finding out anything 
about the qubit's state. 

A related distinction between classical bits and qubits is the impossibility of copying 
the latter. It is very easy to copy a classical bit: even if it is originally unknown, 
one simply has to determine its state, or 1, and then produce one more bit in the 
desired state. Our discussion above indicates that such an approach cannot work for 
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qubits. Actually there is no way of constructing a perfect quantum copying machine, 
that is, a machine which given a qubit in an unknown state produces a copy. This 
is the content of the famous quantum no-cloning theorem [92, 37]. In the following 
we will see that the impossibility of copying quantum information has deep roots: 
it is related to the linearity of quantum mechanics, which is in turn related to the 
impossibility of superluminal communication. 



To our knowledge, the discussion about the cloning of quantum systems started with 
a paper by Herbert [50], where he proposed a method for superluminal communica- 
tion. His scheme made use of pairs of entangled particles shared by the two parties 
that would like to communicate (Alice and Bob), and of what he called idealized 
laser tubes, which would today be called universal cloning machines. The basic idea 
of his proposal was the following. Alice and Bob each have one member of a pair of 
entangled particles, e.g. photons described by the state 



where V and H denote vertical and horizontal polarization respectively. Alice can 
measure the polarization of her particle either in the basis \V), \H) or in the basis 
|P), |M), where \P) = -^{\V) + \H)) and |M) = -j^{\V) - \H)). If Alice measures 
in the V/H basis and finds \V) {\H)), Bob's photon is reduced to \H) (|V")), while 
if she measures in the P/M basis (results \P) and |M)) Bob's photon is reduced 
to the corresponding states in that basis (|M) and |P) respectively). Although the 
states on his side are therefore different depending on Alice's choice of basis, a priori 
this does not allow Bob to know her choice because he cannot discriminate \ V) from 
\P) by a single measurement (having only a single copy). But imagine that he has 
a machine that can produce an arbitrary number of copies of any one-photon state, 
or at least of the states \V) and \P). This would allow Bob to discriminate the 
two states and in this way Alice's two choices of basis. If Bob's copier works fast 
enough, this establishes a superluminal communication channel. Herbert proposed 
stimulated emission as a possible working principle for his copying machine. He was 
aware of the fact that spontaneous emission could be a problem, but thought that 
it would not be fatal for the scheme. 

On the other hand, it is possible to show in a general way that superluminal signaling 
(or, more precisely, signaling between systems whose operator algebras commute) is 
not possible in quantum mechanics [40]. One way of explaining why Alice cannot 
signal to Bob is the following. Bob does not know which result Alice got, therefore he 
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(1.3) 
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has to trace over her degrees of freedom. Thus his photon is described by a density 
matrix ^{\V){V\ + \H){H\), if she measured in the V/H basis, and |(|P)(P| + 
\M){M\), if she measured in the P/M basis. Of course, these two density matrices 
are identical, so there is no way for him to teU what she did. Now it is clear that 
no quantum-mechanical device can lead to a distinction between identical density 
matrices. So something has to be wrong with Herbert's argumentation. Cf. also 
our discussion in the second chapter of this thesis. 

Wootters and Zurek [92] and Dieks [37] showed that the problematic part of Her- 
bert's proposal is the copying procedure. To see this consider a device which pro- 
duces perfect copies of V and H polarized photons, i.e. which performs the following 
unitary transformation: 

\V)\^ho} ^ \V)\V)\i;v) 

\H)\^o) ^ \H)\H)\^h) (1.4) 



How does such a device act on a photon which is linearly polarized under 45 degrees, 
i.e. described by a state vector \P) — ■^{\V) + \H))7 Prom a perfect copier we would 
expect an output of the form \P)\P)\ijjp). On the other hand it follows from Eq. 
(1.4) that 

i=(|y) + \Hmo) - ^ (l^)l^)|V'y) + \H)\H)\i;H)) (1.5) 

It is not hard to see that this output will never be of the desired form. If lipv) 7^ li^n), 
the state of the two copies will even be mixed, which is certainly not what we want. If 
I V'y ) — I i^n) , the state of the two copies will be given hy ^ {\V)\V) + \H)\H)) which 
is different from the desired state \P)\P) ^ 1{\V) + \H)){\V) + \H)). This shows that 
a perfect doner of photons in the V/H Basis is a maximally bad doner for states 
in the complementary basis. Perfect cloning of general input states is impossible. 
This is the famous no-cloning theorem. It was also pointed out by Mandel [61] and 
Milonni and Hardies [68], that perfect cloning in stimulated-emission schemes such 
as proposed by Herbert is prevented by the unavoidable presence of spontaneous 
emission. This will be discussed in detail in the sequel. With all these results the 
discussion on cloning was closed for about fourteen years. 

In 1996 Buzek and Hillery [21] considered cloning from a different point of view. 
They proposed and studied an approximate copying machine, i.e. a device which, 
given a qubit in an unknown input state, produces two approximate copies. They 
also demanded that their machine be universal, i.e. the quality of the copies should 
be the same for all inputs. Buzek and Hillery showed that such a universal quantum 
copying machine can be described by the following unitary transformation: 

|0„)|0,)|0e) ^ j^|0„)|0;.)|le) + j^(|Oa)|U) + | la) lOfc)) |0c) 
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|la)|06)|0c) ^ Xl^\la)\h)\0c) + \llm\h) + |la)|06))|lc) (1-6) 

where a is the system to be copied, b is an auxihary qubit which becomes a copy 
of a through the transformation, and c is another auxihary qubit. Several remarks 
are in order. The final state of Eq. (1.6) is invariant under the exchange of a and 
b, i.e. the copying machine produces two copies with exactly the same properties. 
In this respect, the distinction between the original qubit and its copy is completely 
lost during the copying procedure. The quality of the copies can be quantified by 
the fidelity with respect to the input state, i.e. by F = {ip\pa\ip), where lip) is the 
state of the original qubit and pa = T^hcPabc is the reduced density matrix of the 
qubit a in the final state Pabc- Applying this formula to Eq. (1.6) gives F = |. One 
can show that the transformation (1.6) is indeed universal, i.e. any arbitrary input 
state q;|0) +/3|1) is copied with the same fidelity. In their seminal paper Buzek and 
Hillery did not show that their copying machine is optimal. 

The Buzek-Hillery construction was generalized by Gisin and Massar [44] , who found 
a cloning transformation producing M copies starting from N identical qubits. It is 
described by 

M-N 

Un,m\N^)= E a,-|(M-j>,jV^^)®i?,-(^), 




{M-N)\{M-j)\ 



M + l\^ {M-N-j)\M\ 



where \N%1)) is the input state consisting of N qubits all in the state we have 
denoted |(M — j)^/', jY'"'") symmetric and normalized state with M — j qubits in 
the state and j qubits in the orthogonal state ■0-'-. Rj{ip) are orthogonal states 
of the ancillary qubits which can be written as Rj{'4>) — \{M — 1 — 3)'4>* , j{'4>*)'^) , 
where ■0* is the complex conjugate of ■0. 

The transformation (1.7) looks rather intimidating, or at least unintuitive. We will 
see that actually it arises quite naturally in the context of stimulated emission for 
appropriately designed systems. The form of the coefficients Uj was also already 
explained in a rather intuitive way by the work of Werner [91], which we will discuss 
below. The optimality of the Buzek-Hillery and Gisin-Massar transformations was 
first shown by BruB and co-workers in [18], who made use of the relations between 
quantum cloning and the estimation of quantum states. The general idea of this 
approach is the following: One way of estimating an unknown quantum state is to 
first clone it and then perform a state estimation of its clones. But this cannot be 
better than the optimal state estimation, whose fidelity is known for qubits. On the 
other hand, one way of cloning an unknown state is to first estimate it and then 
produce as many copies as desired of the estimated state. But this cannot be better 
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than optimal cloning. Using these two relations, Brufi et al. derived bounds on the 

optimal cloning fidelities which are saturated by the Buzek-Hillery and Gisin-Massar 
transformations. The optimal fidelity for the case of iV to M cloning of qubits is 

NM + M + N 



N-*M 



M{M + 2) 



(1.8) 



It is not hard to see that the Gisin-Massar transformation (1.7) has this copying 
fidelity. In this case the fidelity can be expressed as the mean relative frequency of 
qubits in the original state ip in the final state, i.e. 

^ = E -Y7^^] = Fn^m, (1.9) 
j=o -"^ 

as can be confirmed using the explicit form of the coefficients aj. The Buzek-Hillery 

transformation is contained in the Gisin-Massar transformation as the simplest spe- 
cial case. As mentioned before it leads exactly to F = Fi_>2 = §■ 

Buzek, Hillery and Werner showed that by choosing the states Rj{ip) in the Gisin- 
Massar transformation (1.7) appropriately, it is possible to realize optimal universal 
cloning and the optimal universal NOT operation simultaneously [19]. The ideal 
universal NOT would be an operation that produces the orthogonal complement of 
an arbitrary qubit. Like perfect cloning, this is prohibited by quantum mechanics. 

The understanding of the structure of the cloning transformations was significantly 
deepened by the work of Werner, who proved that the final density matrix of the 
clones in the optimal N to M cloning transformation can be found in the following 
way, apart from normalization: 



where a — is the original state of the N initial qubits, 1 is the completely 

mixed density matrix, and is the projector onto the completely symmetric sub- 
space of the A^-qubit Hilbert space. This is a rather intuitive formula. One can 
say that at the beginning all the information is contained in the N original qubits. 
The M — N auxiliary qubits are completely mixed and thus contain no information. 
Then, the information is distributed over all M qubits in a completely symmetric 
way. The expression (1.7) for the coefficients aj can be easily derived from Eq. 
(1.10). 



1.3 Optimal Universal Cloning Transformations 

The consideration of Werner's formula Eq. (1.10) leads to a better understanding 
of the Buzek-Hillery and Gisin-Massar transformations. Let us start with the case 
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of 1 ^ 2 cloning of qubits. We look for a universal transformation producing two 
approximate copies of the state ip. It is clear that ancillas may be needed. What are 
the basic elements out of which the final state could be built? If, for the moment, 
we take universality to mean that no direction may be singled out by the form of 
the final state, apart from the direction of ■0, then it is clear that the only elements 
permitted are itself, which may appear only linearly (because of the linearity of 
quantum mechanics), and the singlet state of two qubits S — (|0)|1) — |1)|0)). 

The singlet can be rewritten as 5" = (^iljip-^ — ip-^ifj, where ijj-^ — —/3*\0) + a*\l) 
for ^ = a\0) + P\l). The most general form of the final state is then the linear 
combination 



Note that Si2ip3 is not linearly independent. Eq. (1.11) can be rewritten as 



(1.12) 



This manifestly corresponds to a universal transformation because everything is 
expressed in terms of ip and ip^. Its linearity is obvious by construction. The 
case A = B is exactly the Buzek-Hillery symmetric universal doner. The cases 
Ay^ B correspond to asymmetric universal doners [70, 15]. The third particle is an 
anti-clone. 

As a first generalization, the above construction can be extended to the — > M 
cloning case. Now linearity implies that N instances of ip have to appear in the final 
state, which have to be supplemented by M — singlets. Thus, the final state is a 
linear combination of the term 



and its permutations. The Gisin-Massar optimal universal doners correspond to the 
case where the above expression is symmetrized over the first A^ qubits. This gives 
an even more precise sense to our above statement that the information about ip is 
spread out over the M copies in a symmetric way. Originally, the qubits A^+1, . . . , M 
contain no information because they belong to singlet states. Not completely sym- 
metrized linear combinations correspond to general asymmetric doners. 

To generalize the above considerations to the (i-dimensional case, where there are 
no singlet states, one has to note that our above definition of universality was more 
restrictive than the usual one. Let us now only demand that the reduced density 
matrix of each clone be of the form s | ^) | + 1 . Then one can replace the singlets 
in the above construction by the maximally entangled state X — Ylt=i l'^) l'^)- Apart 
from this substitution the calculation remains exactly the same as for qubits. In the 



Aipi S23 + Bil)2 S13. 




■01 . . . IpN 'S'Ar+l,M+l 



■ Sm,2M-N 



(1.13) 
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1 — > 2 case, one has the family of final states 



d d 



(1.14) 



n=l n=l 



To determine the rednced density matrix of the first particle, consider the density 
matrix of the first two: 

d 



Tracing over the second particle, one sees that the reduced density matrix indeed 
only depends on ^iid 1- Again the case A — B corresponds to the general 

universal optimal doners of Buzek and Hillery [22]. The construction generalizes 
to the N M case [23]. Thus we have derived all the optimal universal cloning 
transformations in a simple and unified way. 

We will come back to the above considerations when explaining why our stimulated 
emission doners are optimal in section 1.7. We now turn to a detailed investiga- 
tion of quantum cloning by stimulated emission. The relationship between cloning 
and superluminal communication suggested by Herbert's original proposal will be 
explored in the sequel. 

1.4 Cloning with Lambda— Atoms 

When the Buzek-Hillery and Gisin-Massar transformations were discovered, the 
realization was first discussed in the context of quantum computation, in other 
words in terms of gates. For example, a network for the Buzek-Hillery doner was 
suggested in [20]. However, there is actually a physical process which seems a 
very natural candidate for quantum cloning, namely stimulated emission. This was 
already realized by Herbert [50], who thought however about perfect cloning. It was 
then pointed out by Mandel [61] and Milonni and Hardies [68] that perfect cloning 
is prevented by the presence of spontaneous emission. In the new context of non- 
perfect but universal cloning following Buzek and Hillery, one is led to the question 
of how well cloning via stimulated emission could work. Could it be optimal? We 
will see that the answer is yes. 

Let us first recall the basic facts about stimulated emission by considering the sim- 
plest possible model, namely a two-level system with levels e and g, coupled to a 
single mode a of the electro-magnetic field via the Hamiltonian 





n=l 



H^j{\e){g\a + h.c.), 



(1.16) 
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where 7 is a coupling constant. 

This is known as the Jaynes-Cummings model. The two-level system plays the role 
of inverted medium, i.e. it is supposed to be prepared in the excited state |e). Now 
consider the transition amplitudes for emission of a photon. For early times their 
size is determined by the matrix elements of the Hamiltonian between the respective 
states, i.e. by 



where we have assumed that n photons are already present in node a. From the 
commutation relation [a, a^] = 1 of the bosonic operator a it follows immediately 
that (n + l|a^|n) = \/n + 1. Therefore this matrix element scales like \/n + 1. This 
implies that the two-level system is more likely to emit a photon in a given mode the 
more photons are already present in this mode. The photons that are already present 
stimulate the emission of new photons. This is of course the fundamental principle 
for the working of lasers. Spontaneous emission corresponds to the case where 
originally there are no photons in mode a, i.e. to the matrix element (e|(l|i?|g')|0). 
Sometimes one says that in this case the emission is stimulated by the vacuum 
fluctuations. 

Now imagine an inverted medium that can emit photons of different polarizations. 
Two modes, ai and a2, are sufficient to describe polarization. For the sake of clarity 
and simplicity we are going to neglect the fact that in general there are different 
spatial modes. If the system is in some sense rotationally invariant and one sends in 
an Qi photon, the emission of ai photons which is stimulated should be more likely 
than the emission of 02 photons, which is only spontaneous. This implies that some 
kind of copying should occur. It is also clear that this copying procedure might 
not be perfect since spontaneous emission is going to happen, i.e. in general there 
will be unwanted 02 photons. In fact, the presence of this spontaneous emission is 
unavoidable if one demands universality of the copying, as will become clear below. 
Note that if our inverted medium is really rotationally invariant this should ensure 
the universality of the cloning procedure. In such a situation spontaneous emission 
into both polarization modes will be equally likely, but because of stimulation the 
amplitude for emission into the desired mode will be larger, roughly by a factor 
y/n + 1, where n is the number of incoming photons, as pointed out above. 

We will start by looking at perhaps the simplest model system which has all the 
required properties. It turns out that this is already perfectly suited to achieve 
optimal universal cloning. The inverted medium that we will use as a cloning device 
consists of an ensemble of Lambda-atoms. These are three-level systems that have 
two degenerate ground states \gi) and \g2) and an excited level |e). The ground states 
are coupled to the excited state by two modes of the electromagnetic field, ai and 02, 
respectively. These two modes define the Hilbert space of our qubit to be cloned, i.e. 
we want to clone general superposition states {aai+Pa2)\0, 0) = a\l, 0)+P\0, 1). We 
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can think of ai and 02 as being orthogonal polarizations of one photon with a specific 
frequency, but wc do not have to restrict ourselves to such a specific example, in 
fact we can think about other systems and other degrees of freedom, as long as they 
are described by the same formalism, e.g. ai and a2 could also refer to the center- 
of-mass motion (phonons) in an ion trap. In the interaction picture, after the usual 
dipole and rotating wave approximations, the interaction Hamiltonian between field 
and atoms has the following form: 

/ N N \ 

ni = ^ia,J2\e'){9i\+<^2Y.\^'')i92\]+h.c. 

\ k=l k=l / 

= 7 UiE<i + «2E<2 (1.18) 

\ fc=l k=l ) 

The index k refers to the fc-th atom. Note that in (1.18) the atoms couple to only 
one single spatial mode of the electromagnetic field. In particular this means that 
spontaneous emission into all other modes is neglected. Situations where this is a 
good approximation can now be achieved in cavity QED [49] . We also assume that 
the coupling constant 7 is the same for all atoms, which in a cavity QED setting 
means that they have to be in equivalent positions relative to the cavity mode. 
Trapping of atoms inside a cavity has recently been achieved [93]. Finally note 
that our Hamiltonian has no spatial dependence, which means that the effect of the 
field on the motion of the atoms is neglected, their spatial wave-function is assumed 
to be unchanged. This leads to the question what the spatial wave-function could 
be. The most fascinating possibility would probably be to imagine a Bose-Einstein 
condensate. 

The Hamiltonian (1.18) is invariant under simultaneous unitary transformations of 
the vectors (01,02) and (1(71), 1(72)) with the same matrix U . If one furthermore 
chooses an initial state of the atoms that has the same invariance, then the system 
behaves equivalently for all incoming photon polarizations, i.e. universal cloning is 
achieved. This can be seen in the following way. Consider an incident photon in 
a general superposition state (ctal -|- /9a2)|0,0). Together with the orthogonal one- 
photon state this defines a new basis in polarization space, which is connected to the 
original one by a unitary transformation. If the atomic states are now rewritten in 
the basis that is connected to the original one by the same unitary transformation, 
then under the above assumptions the interaction Hamiltonian and initial state of 
the atoms look exactly the same as in the original basis. The initial state where all 
atoms are excited to |e) has the required invariance: it is completely unaffected by 
the above-mentioned transformations. 

We can therefore, without loss of generality, restrict ourselves to the cloning of 
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photons in mode ai. We consider an initial state 

l*J = ®ti|e')^|0,0), (1.19) 

i.e. we are starting with m photons of a given polarization, and we want to produce 
a certain (larger) number n of clones. 



1.4.1 The simplest case 

For illustrative purposes let us first consider the simplest case of one Lambda-atom 
and one photon polarized in direction 1: 

= |e)4|0,0) = |e)|l,0) =: |j^o) (1-20) 

study the time development, we expand the evolution operator e~*^* into a Taylor 
3S and determine the action of powers of H on the state | . 

n\^,n) = 7{\9i)a\\l,0) + \g2)al\l,0)) 

v3 

n'^l^in) = 7'(|e)aiV2|2,0) + |e)a2|l,l)) = 37'|e)|l,0) = 37'|jro) 



Tos 
series 



(1.21) 



The result is 



= cos(7V3i)|e)|l,0)-zsin(7V3i)(y||^i)|2,0) + y||52)|l,l)) 

= cos(7\/3i)|^o) -«sin(7A/3i)|^i) (1.22) 

\J-'o) and l^i) denote the states of the system atom-photons that lie in the subspace 
with 1 and 2 photons respectively. |^o) is in the subspace where no cloning has 
taken place and in the one where one additional photon has been emitted, so 
that the two photons can now be viewed as clones with a certain fidelity. This way 
of labeling the states will turn out to be convenient below. The probability that the 
system acts as a doner isp(l) = sin^(7\/3t). The fidelity Fi of the cloning procedure 
can be defined as the relative frequency of photons in the correct polarization mode 
in the final state \J^i) (cf. Sec. 1.6). One finds 
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which is exactly the optimal fidelity for a l-to-2 doner, cf. Sec. 1.2. Actually, the 
state 

|^i) = y||2,0)|^i) + ^|l, 1)1^2) (1.24) 
is exactly equivalent to the three-qubit state 

/||ii)U) + /|(:^(|oi) + |io)))|T) (1.25) 

produced by the Buzek-Hillery doner. The equivalence is estabhshed, if the photonic 

states in Eq. (1.24) arc identified with the corresponding symmetrized two-qubit 
states (both photons in mode 1 means both qubits in state |1), one photon in each 
mode means one qubit in state |1), one in state |0)) in Eq. (1.25), while the atomic 
states \gi) and \g2) are identified with the states | I) and | t) of the ancillary qubit. 
This is another way of proving the optimality of Eq. (1.24). Note that in our case the 
universality follows directly from the symmetry of initial state and Hamiltonian, as 
explained above. In the following we show that a similar equivalence holds between 
our cloning scheme and the Gisin-Massar doners in the completely general case 
(arbitrary numbers of photons and atoms). 



1.4.2 Equivalence to coupled harmonic oscillators 

We now turn to the discussion of the general case, i.e. we consider the initial state 
(1.19). We are going to show the equivalence of our system defined by (1.18) and 
(1.19) to a system of coupled harmonic oscillators. First note that both the initial 
state (1.19) and the Hamiltonian (1.18) are invariant under permutations of the 
atoms, which implies that the state vector of the system will always be completely 
symmetric. Furthermore the Hamiltonian (1.18) can be rewritten as 

7i = 7(aiJ+,i + a2J+,2) +/i.c. (1.26) 

in terms of "total angular momentum" operators 

N N 

J-^,r^i:<r = j:\^'){9r\ (^=1,2), (1.27) 

fc=l k=l 

By the above considerations one is led to use a Schwinger type representation [79] 
for the angular momentum operators: 

J+,r^brC^ (r = l,2), (1.28) 

where is a harmonic oscillator operator creating "e" type excitations, while 61 
destroys "g'l" excitations. Note that J+_i and J+_2 share the operator because 
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both ground levels gi and g2 are connected to the same upper level e by the Hamil- 
tonian (1.18), and correspondingly for the Hermitian conjugates. In terms of these 
operators, (1.18) acquires the form 



= ^^^|0) = |m„i,0„2,0w,0b2,iV,) = |m,0,0,0,iV). (1.30) 
\/m.\ \/ N\ 



Actually, for reasons that will become apparent below, it is slightly more convenient 
for our purposes to use the following Hamiltonian instead of (1.29): 



which can be obtained from (1.29) by a simple unitary transformation in mode 6, 
corresponding to a simple redefinition of the atomic states in (1.18). This is the 
Hamiltonian that is going to be used in the rest of this paper. The invariance 
properties of (1.31) are linked to those of (1.18) or equivalently (1.29) discussed 
above: (1.31) is invariant under simultaneous identical SU(2) transformations in 
modes a and h (because the determinant of such a transformation is equal to unity), 
while a phase transformation in either mode can be absorbed into 7. This ensures 
the universality of the cloning procedure. 

We are now dealing with five harmonic oscillator modes defined by the operators 
0,61,62,01, and a2. Action of (1.31) on (1.30) generates Fock basis states of the 
general form 

\{m + j)al-,ia2-,ibl-,3b2-,{N - i - i)c) = \m + j,i)photons\i, j, N - i - j)atoms- (1-32) 

Remember that Oi is now coupled to 62 etc. Expressed in terms of individual atoms, 
\i,j,N — i — i) atoms is the completely symmetrized state with i atoms in level gi, 
j atoms in level gf2, and N — i — j atoms in level e. The correctness of (1.28) can 
be checked by explicit application of left hand side and right hand side to such a 
general state, written in terms of the individual atoms and in terms of harmonic 
oscillator eigenstates respectively. 

As noted above, the action of the Hamiltonian (1.18) on the initial state (1.19) only 
generates completely symmetric states of the atomic system. These states have the 
general form 



Ti-osc = 7(^1^1 + a262)c' + h.c, 



(1.29) 



while the initial state (1.19) is now given by 




H — 7(0162 — 0261)0^ + h.c, 



(1.31) 



( 



N 



) 



1/2 



a.i Oi+i ct 
9x\92 ,---,^2 




a 



iJ,N -i-j) 



atoms 



(1.33) 
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where the sum is over all arrangements a of the N — i — j levels |e), the i levels 
l^fi), and the j levels \g2) on the atoms, and (J^^^ — is the multinomial 

coefficient giving the number of such arrangements. 

Now study the action of a typical term in the Hamiltonian (1.18) on the system 
whose state we will write as 

N -i- j) atoms |m + i,j)photons- 

N 

(E \9l){(^''\)ai\iJ, N - i-j) atoms ® |m + photons 

k=l 

N 



k=l 



i\m + i,j) field 

+ 1 ) i ''^'^^m~^^' E I , ■ ■ ■ , 9V , 9^' , 9^' , 

i\m + ij) field 



r-T f^r~- ■ (^ + l)!J!(iV-^-J•-l)! 

E \9i\ • • • , 9r,9T^\9T^\ e"^) al\m + i,j)fieid 



= ViTl^[^^^^i^j\i + 1J,N -i - j - 1) atoms <^a{\m + ij) field (1-34) 

Here the factor (i + 1) arises from the number of different configurations that a 
given arrangement a can be reached by. This shows that this term acts exactly 
like a term a|6|c. Similar calculations can be made for the other terms in the 
Hamiltonian. Together, they justify the Schwinger representation (1.28). Note that 
the use of the Schwinger representation is only convenient because the initial state 
of the atomic system in (1.19) is completely symmetric under permutation of the 
atoms. 

Studying the Hamiltonian in the form (1.31) instead of (1.18) is helpful in several 
respects. The number of atoms N that is exphcit in the Hamiltonian (1.18) now 
appears only as a part of the initial conditions of our system, which makes it easy to 
treat the general case of N atoms in one go. We will do this in the next subsection. 

The Hamiltonian (1.31) can also be seen as a Hamiltonian for down-conversion with 
a quantized pump-mode described by the operator c, while and br are the signal 
and idler modes respectively, where r labels the polarization degree of freedom. 
Usually in parametric down-conversion the operator c of (1-31) is replaced by a c- 
number. This corresponds to the hmit of a classical pump field. These remarks lead 
to an experimental realization of optimal quantum cloning via stimulated emission 
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which we will discuss in Sec. 1.5. 

In passing we note that the above dynamical equivalence generalizes to atoms with 
more than 2 ground-states \gn) that are coupled each to a different degree of freedom 
of photons an- By similar arguments a system of N identical atoms with r ground 
states (Is'i), . . . , \gr)} governed by a Hamiltonian 

JV r 

^' = 7EE|e')(^^l«n + /i.c. (1.35) 

k=l n=l 

is equivalent to a system of r+l coupled harmonic oscillators with lowering operators 
c and bi, . . . ,br governed by the interaction Hamiltonian 

Ksc-li:cbWn + h.c. (1.36) 

n=l 



1.4.3 Cloning of m photons with N Lambda-atoms: Proof 
of optimality 

We are now going to show that the system defined by (1.30) and (1-31) indeed realizes 
optimal cloning for arbitrary N and m. The idea of the proof is the following. After 
evolution in time the system that started with a certain photon number m will be 

in a superposition of states with difi^erent total photon numbers, where total means 
counting photons in mode ai and 02, i.e. both "good" and "bad" copies. We will 
show that the general form of the state vector after a time interval t is 

N 

\^{t))^e-''''\^,n) = j:MtW), (1.37) 

1=0 

where I denotes the number of additional photons that have been emitted and 



]\{m+l-i)ai,ia2,ibi, {l-i)b2, {N-l)c 



m 

(1.38) 

Note that the number of photons can never become smaller than m since all the 
atoms start out in the excited state. is a normalized state of the system with 
m-\-l photons in total. To see that \!Fi) is properly normahzed note that Y}i=o ("^*) — 

The states are formally identical to the states obtained in [19], which have been 
shown to realize optimal universal cloning and the optimal universal NOT simul- 
taneously. The ideal universal NOT is an operation that produces the orthogonal 



18 



1.4 Cloning with Lambda— Atoms 



complement of an arbitrary qubit. Like perfect cloning, it is prohibited by quantum 
mechanics. The transformation in [19] links universal cloning and universal NOT 
( ant i- cloning): the ancilla qubits of the cloning transformation are the anti-clones. 
In our case, the clones are the photons in the a-modes and the anti-clones are the 
atoms in the fe-modes (atomic ground states gi and g2)- Prom the Hamiltonian 
(1.31) and (1.38) it is clear that for every "good" emitted photon-clone (in mode 
ai) there is an excitation in mode 62 which corresponds to an anti-clone (atomic 
ground state \g2))- The only difference to the states in [19] is the presence of the 
fifth harmonic oscillator mode c, describing the "e" type excitations, which counts 
the total number of clones that have been produced (equal to the number of atoms 
having gone to one of the ground states) and doesn't affect any of the conclusions. 

A distinguishing feature of our doner is that the output state (1.38) is a superpo- 
sition of states with different total numbers of clones. Cloning with a certain fixed 
number of produced copies can be realized by measuring the number of atoms in 
the excited state |e) (corresponding to mode c) and post-selection. 

To prove that the system is indeed always in a superposition of the states \J-'i) as 
in Eq. (1.38) we use induction: The initial state of the system is l^m) = l-^o)- 
Now we will show that if |$) is a superposition of states \J-'i) then 7Y|$) is so, too. 
Then, since |^(t)) = e-*^*|^i^) = Ep^^^ff^l^m) this implies that |^(t)) wiU be a 
superposition of \J-'i)- Explicit calculation shows that 

n\J^i) = -f{^{l + l){N-l){m + l + 2)\J^i+,) 

+^1{N - I + l)(m + I + l)\J^i-i)) 1<1<N 
n\J^o) = 7\/N{m + 2)\Ti) 

'H\Tn) -= -f^JN{m + N +l)\J^N-i) (1.39) 
which completes the proof. 

Note that the form of the coefficients fi{t) didn't play any role in our proof. Actually, 
the fi are in general hard to determine exactly. Solutions have been found in limiting 
cases. For the limit of a classical pump field (c replaced by a c-numbcr), the solution 
can be found by standard methods and is given in Sec. 1.5 in the context of a 
proposed experimental realization of quantum cloning. The solution in the case of 
large incoming photon numbers (m ^ N) can be obtained in the following way. 

For that case, the recursion (1.39) becomes 

n\J^i) = iV^i^il + 1){N- Ol^m) + ^/KN - I + l)|^;-i)) 1<1<N 

HITn) = Tv^ViVl-^iv-i) (L40) 
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It is possible to diagonalize the "transfer" matrix A acting on the vector (/q, . . . , /at) 
that corresponds to the action oiH on \^) — Y^fLo fi\^i)' 



This allows to exponentiate A and to determine the final state of the system after 
a time t: 



1=0 ^ \ / 

Differentiating (1.42) and using (1.40) one can show that this state fulfills Schrodinger's 
equation with the correct initial condition. 

In this big-m-limit the probability to observe the system as an m — > m + / doner 
(i.e. the probabihty that I additional photons are emitted) is 



This is a binomial distribution with a probability sin^(7y^t) for each atom to 
emit a photon. Setting = 1 or comparison with Eq. (1.22) shows that this is 
identical to the probability for the case of only one atom in the case of large m. This 
means that in this limit each atom interacts independently with the electromagnetic 
field, because the effect of the other atoms on the field is neghgible. In the short- 
time limit p{l) = Oit^^). Furthermore the expected average number of "clones" 
Nc — J2iLo lp{l) — N si'D^{■y^/mt) oscillates with an m-dependent frequency. 

Let us pause here for a moment and summarize what wc have found. Our system 
consisting of an ensemble of Lambda-atoms in the excited state is indeed equivalent 
to a superposition of optimal cloning machines a la Buzek-Hillery or Gisin-Massar, 
producing various numbers of clones. The atoms play the double role of photon 
source and of ancilla, the atomic ground states can be identified with the ancilla 
states in the qubit doners. As for the corresponding qubit doners, those ancillary 
atoms can also be seen as the output of a universal NOT gate. On the other hand, 
the atoms that end up in the excited state provide information about the number 
of clones that has actually been produced. This can be used to realize cloning and 
anti-cloning with a fixed number of output clones by post-selection. 

1.4.4 The equivalence between pairs of V— atoms and 
Lambda— atoms 

In this section we present an alternative (but similar) way of realizing optimal uni- 
versal cloning that uses entangled pairs of V-atoms instead of Lambda atoms. We 






(1.42) 




(1.43) 
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prove optimality by showing that the system can be exactly mapped onto the system 
with Lambda atoms that we discussed above. 

The two degenerate upper levels of each V-atom, |ei) and |e2), are coupled to the 
ground state \g) via the two orthogonal modes ai and 02 respectively. The Hamil- 
tonian describing the interaction of atom and field is: 

/ N N \ 

-Hv = 7 4El^')(e?l + 4El^')(e2l +h-c. 

V k=l fc=l / 

/TV N \ 

= 7 4E<i + 4E<2 +h-c. (1.44) 

\ k=\ fe=l / 

It arises from similar assumptions as (1.18). In contrast to before we now choose 
an entangled state of the atoms as the initial state. This is motivated by the fact 
that the initial atomic state has to be a singlet under polarization transformations 
in order for our cloning device to be again universal. 

Let us first examine the simplest case of two entangled V-atoms, A and B, and one 
incoming photon. The initial state of the system is 

l*.n) = ^(|e^f)-|e^f))®|l,0) (1.45) 

Developing the time evolution operator e~^^'^^ into a power series, one finds easily: 

e-^^^*|^,„) = cos(7^t)^^^^^^^|l,0) 
-isin (7^3^) (4^^^^^|2, 0) + yj^^^^^^ll, 1)) (1.46) 
With the substitution 



lefef ) - 


\eief) 


g^ef) - 1 


eig^) 


etg^) - 1 


g^ef) 


V2 



^\gi} 

► m (1.47) 

the state (1.46) has exactly the same form as the corresponding state (1.22) for 
one Lambda-atom, which implies that it also implements optimal universal 1 — > 2 
cloning. 

Actually, the correspondence goes much further. Consider an initial atomic state 
consisting of N pairs of V-atoms, where each pair is in a singlet state: 

\^^)-^tl\e') (1.48) 
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with |e) as defined in (1.47). 

It is easy to see that the action of the Hamiltonian (1.44) on each pair only generates 
one of the three antisymmetric atomic states in Eq. (1.47). Because of the invariance 
of the Hamiltonian under permutations, and in particular under the exchange of two 
atoms belonging to the same pair, transitions between states with different symmetry 
properties are impossible. In fact, with the identification (1.47) the Hamiltonian 
(1.44) has exactly the same form as the Hamiltonian for Lambda-atoms (1.18). The 
analysis made previously for Lambda atoms now goes through unchanged and we 
obtain the same cloning properties of a system of pairwise entangled V-atoms as we 
had before for Lambda-atoms, i.e. we have found another way of realizing optimal 
universal cloning. Although this scheme would without doubt be more difficult to 
realize experimentally, we believe that the underlying equivalence between the two 
systems is interesting and may be useful in other contexts as well. 

1.4.5 Single V— Atoms are sub-optimal doners 

Prom the results of the previous sections, one might be tempted to conclude that the 
fulfillment of the symmetry requirements discussed above already implies optimality 
of the cloning procedure. Here we show that this is not the case by studying an 
explicit example of universal but suboptimal cloning via stimulated emission. 

We are again considering an ensemble of V-atoms where each atom is initially in 
the mixed state 



which is invariant under the same unitary transformations. The invariance of both 
Hamiltonian and initial state together ensure the universality of the cloning pro- 
cedure. Therefore it is sufficient to analyze the performance of the doner for one 
arbitrary incoming one-photon state; we choose = a||0). 

We have performed numerical computations for systems of a few (up to iV = 6) 
atoms. Prom (1.44), the time development operator U — e~*^* for the whole atoms- 
photons system was calculated. Use was made of the fact that A^i and N2, which 
denote the sum of the number of photons plus the number of excited atoms for 
mode 1 and 2 respectively, are independently conserved quantities. Therefore the 
whole Hilbert space is decomposable into invariant subspaces, i.e. H and U are 
block-diagonal. 

The final state of the procedure is an entangled state of the atom-photon system 
that has components with various numbers of photons, where the maximum possible 
total number is A?" -|- 1 (if all atoms have emitted their photons) . The probabihty 
to find k "right" and I "wrong" photons in the final state, denoted by p{k,l), was 




(1.49) 
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calculated for all possible values of k and I and for different values of ^t, and from 
it the overall average "fidelity" 



was determined. This is the average of the relative frequency of photons with the 
correct polarization in the final state. Note that in (1.50) the average is performed 
only over those cases where there are at least two photons in the final state, i.e. 
where at least one clone has been produced. p'{k, I) = p{k, /)/(l — p(l, 0) — p{0, 1)) 
is used in order to have proper normalization. Note that p(0, 0) is always zero. 

That average fidelity for our cloning procedure was compared to the average fi- 
delity that would be achieved by an ensemble of optimal doners producing the same 
distribution of numbers of photons, i.e. to 



where p'{n) — J2 p'{k,l). We also made a comparison to the case, where, in 

k+l=n 

addition to the incoming photon, photons are just created randomly, i.e. to the 
fidelity 



Fig. 1.1 shows clearly that the fidelity of our cloning procedure approaches the 
optimum fidelity for early times. One can also see that for longer interaction times 
f clones departs from fopt and even becomes lower than fraud- This behavior, which 
may seem surprising, is due to the fact that for longer times absorption of photons 
by atoms that have already emitted once and gone to the ground-state becomes 
important. Note that absorption of "right" photons is favored if there are more such 
photons present. In particular, also the incoming "right" photon can be absorbed 
by an atom that has emitted a "wrong" photon before, resulting in departure from 
optimality for later times. The superiority of frand in that regime is understandable 
because in our idealized random doner the incoming photon is always left intact. 

Our computations show that the system goes through many emission-reabsorption 
cycles, though without exhibiting a simple periodicity. As a consequence, over long 
times f clones oscillates taking values above and below frandi sometimes approaching 
/opt again. 

Fig. 1.2, which also illustrates the above-mentioned cychc behavior of our system, 
shows the time dependence of the mean number of photons and of the mean number 
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Figure 1.1: Dependence on time, measured in units of 7^, of fopt, f clones, ^-nd fraud, 
which are the optimum possible fidehty, the fidehty achieved by our V- 
atom cloning procedure, and the fidelity achieved by random photon 
production respectively, as defined in Eqs. (1.51,1.50,1.52), for the case 
oi N — 6 atoms. It is evident that optimal cloning is achieved in the 
short-time limit. The behavior for lower atom numbers is the same. 
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Effective time 



Figure 1.2: Time dependence of the mean number of all photons N^u and of the 
mean number of "right" photons (i.e. of the same polarization as the 
incoming photon) N right for the case N — 6. 

of photons of the correct polarization. For short times, which is the interesting 
regime from the point of view of cloning, the probability for every individual atom 
to have already emitted its photon is low. Therefore, in order to produce a reasonable 
average number of clones in this regime, a large number of atoms is necessary. 

Our results show that symmetry alone is not sufficient to achieve optimal cloning. 
But note that even in this case optimality is approached for short interaction times. 

1.5 Experimental realization 

Here we propose a concrete experimental realization of the ideas discussed in the pre- 
vious sections. The scheme for quantum cloning that we want to present is based on 
stimulated parametric down-conversion (PDC). We will show that optimal cloning 
can be reahzed with present technology. In PDC a strong hght beam is sent through 
a crystal. There is a certain (very low) probability for a photon from the beam to 
decay into two photons such that energy and crystal momentum are conserved. In 
typc-11 PDC the two photons that are created have different polarization. They are 
denoted as signal and idler. 

Fig. 1.3 shows the setup that we have in mind. We consider pulsed type-II frequency- 
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Figure 1.3: Setup for optimal cloning by parametric down-conversion [61, 35, 67]. 



The pump-pulse is split at the beam splitter BS. One part of the pump 
pulse hits the first crystal Ci, where photon pairs are created with a 
certain rate. One photon from each pair can be used as a trigger. The 
other photon is the system to be cloned. This photon is directed towards 
the second crystal C2, where it stimulates emission of photons of the 
same polarization along the same direction. The path lengths have to 
be adjusted in such a way that the DC-photon and the second part of 
the pump pulse reach C2 simultaneously. The photons in mode 1 are 
optimal clones of the incoming photon, and the photons in mode 2 are 
the output of an optimal universal NOT-gate. It is interesting to note 
that in this scheme one is actually cloning a photon that is part of an 
entangled pair. 
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degenerate PDC. It is possible to choose two conjugate directions for the signal and 
idler beams such that photon pairs that are created along these two directions are 
entangled in polarization [59]. We consider the quasi- collinear case (i.e. the two 
directions almost coincide), so that the transverse motion of the photons in the 
crystal is not important. 

For stimulated emission to work optimally, there has to be maximum overlap of 
the amplitudes of the incoming photon and of all the photons that are produced in 
the second crystal. This can be achieved by using a pulsed scheme together with 
filtering of the photons before detection [95]. The pump pulse can be seen as an 
active volume that moves through the crystal. If the photons arc filtered so much 
that the smallest possible size of the wavepackets detected is substantially bigger 
than the pump pulse, then there is maximum overlap between different pairs created 
in the same pulse. Of course, filtering limits the achievable count rates. Moreover 
the group velocities of pump pulse, signal {V) and idler (H) photons are not all 
identical. This leads to separations (of the order of a few hundred fs per millimeter 
in BBO), which have to be kept small compared to the size of the DC-photon wave 
packets. There is a trade-off between filtering and crystal length, i.e. one can choose 
narrower filters in order to be able to use a longer crystal (which leads to longer 
interaction times). 

If the above-mentioned conditions are fulfilled, then a single spatial mode (i.e. one 
mode for the signal and one for the idler photons) approximation can be used. The 
PDC process can then be described in the limit of a large classical pump pulse, in 
the interaction picture, by the Hamiltonian 



where Oyi is the creation operator for a photon with polarization V propagating 
along direction 1 etc. The coupling constant and the intensity of the classical pump 
pulse are contained in 7. As discussed above this Hamiltonian corresponds to the 
limit of the A-atom Hamiltonian (1.18) for a coherent state of mode c. 

The Hamiltonian H is invariant under simultaneous general SU (2) transformations 
of the polarization vectors {oy, a'^) for modes 1 and 2, while a phase transformation 
will only change the phase of 7. This makes our doner universal, i.e. its performance 
is polarization independent. Therefore it is again sufficient to analyze the "cloning" 
process in one basis. 

The time development operator e^*-^* clearly factorizcs into a VI — H2 and an 
HI — V2 part. Consider cloning starting from N identical photons in the initial 

(a' 

state = — 7=p|0) Making use of the disentangling theorem [89, 31] one finds 



H = 7(ayiaif2 - am«y2) + h.c, 



(1.53) 
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where F = tanh and K is a, normahzing factor. This reproduces the results of the 
previous section cf. Eq. (1.37,1.38). We see, that in this limiting case it is possible 
to determine the coefficients fi{t) of Eq. (1.37) explicitly. 

The component of this state which has a fixed number M of photons in mode 1, is 
proportional to 



This is identical to the state produced by the unitary transformation written down 
in [19] which can be seen as a special version of the Gisin-Massar doners [44] that 
implements optimal universal cloning and the optimal universal NOT-gate at the 
same time. The M photons in mode 1 are the clones, while the M — N photons 
in mode 2, which act as ancillas for the cloning, are the output of the universal 
NOT-gate, the "anti-clones" . 

This means that the setup of Fig. 1.3 works as an ensemble of optimal universal 
cloning (and universal NOT) machines, producing different numbers of clones and 
anti-clones with certain probabilities. Note that each of the modes can be used as a 
trigger for the other one and therefore cloning or anti-cloning with a fixed number 
of output-systems can be realized by post-selection. 

We have shown a method of realizing optimal quantum cloning machines. We em- 
phasize that this scheme should be experimentally feasible with current technology. 
In our group, pair production probabilities of the order of 4- 10~^ have been achieved 
with a 76 MHz pulsed laser system (UV-powcr about 0,3 W) and a 1 mm BBO crys- 
tal, for 5 nm filter bandwidth. Past experiments show that good overlap of photons 
originating from different pairs is achieved under these conditions. With detection 
efficiencies around 10 percent, this leads to a rate of two-pair detections of the order 
of one per a few seconds. 

Let us note that experiments in the spirit of the present proposal are currently under 
way in at least two laboratories [35, 14]. First results were reported in [36]. 




M-1)vi\1)hi\1)v2\M-N-1)h2. 



(1.55) 
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1.6 Cloning of photons versus cloning of qubits 

In this section we are going to discuss the physical differences that exist in spite of 
the formal equivalence proven above between our photon doners based on stimulated 
emission and the qubit doners as usually considered [21, 44]. In particular, we will 
show that the claim that optimal cloning is realized by our devices is justified in 
spite of these differences. 

In most of the previous work cloning was discussed in terms of quantum networks. 
In general, the situation considered in these papers is the following: one has a 
certain number of qubits that are localized in different positions, which makes them 
perfectly distinguishable. At the beginning, some of those qubits arc the systems to 
be cloned, the others play the role of ancillas. After the cloning procedure, which 
consists of several joint operations on the qubits that can be expressed in terms of 
quantum gates, some of the qubits are the clones, the rest are ancillas, which for 
a specific form of the optimal cloning transformation can also be seen as outputs 
of the universal NOT operation. As a consequence of localization, it is possible to 
address individual clones. 

In our stimulated emission doners, the situation is different. All input systems 
(photons) are in the same spatial mode (called mode a) , and, even more importantly, 
all clones are produced into that mode. Note that this is completely unavoidable if 
stimulated emission is to be used. One can say that this is the price one has to pay 
for the great conceptual simplicity of the cloning procedure itself. 

However, having all clones in the same spatial mode is not necessarily an impor- 
tant disadvantage. For example, if perfect cloning of that kind were possible, one 
could still determine the polarization of the original photon to arbitrary precision by 
performing measurements on the clones. This would still make superluminal com- 
munication possible. It may be interesting to note that in the paper that started 
the whole discussion about quantum cloning, Herbert [50] considered cloning via 
stimulated emission and therefore necessarily into a single spatial mode. 

If one wants to distribute the clones to different locations, this can for example be 
achieved using an array of beam splitters. However, this does not lead to a situation 
where one can be sure to have exactly one photon in each mode. If one wants to 
have at most one photon in each mode, the array has to have many more output 
modes than there arc photons. 

Another distinguishing feature of our doners compared to the usual qubit doners 
is the fact that the same procedure is used to produce different numbers of clones. 
While in the qubit case the network to be used depends on the number of desired 
clones, in our case the final state is a superposition of states with different numbers 
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of clones. Of course, the average number of clones produced depends on the number 
of atoms present in the system and the interaction time. As discussed in Sec. 1.4 
cloning with a fixed number of output clones can be achieved by post-selection based 
on a measurement of the number of excited atoms in the final state. 

The formal equivalence between the qubit doners and our one-mode doners can 
arise because the output state produced by the optimal qubit doners is completely 
symmetric under the exchange of clones [21, 44]. Because of the bosonic nature 
of the photons there is a one-to-one-mapping between completely symmetric qubit 
states and photonic states. Note that asymmetric cloning could not be realized by 
the presented method. For a completely symmetric qubit state the two concepts of 
relative frequency of qubits in the "correct" basis state and of single-particle fidelity 
are equivalent. This can be seen in the following way. Let 1-0) denote the state that 
is to be copied. Then the usual definition of the (single-particle) cloning fidehty is 

F={lP\pred\^), (1.56) 

where Pred is the reduced density matrix of one of the clones, say the first one, i.e. 

Pr-ed = Tr2,3,...,v[p] (1-57) 
Then F can also be expressed as 

F = Tr[p |^)(V'|i®/2®...®/v]. (1.58) 

On the other hand, the relative frequency of qubits in the state can be written 
as 

-^TV [p (|V')(V'|i ®h® ... ®lN + h® |V')(V'|2 ® ... /jv + ... + h® ... ® \^){^\n)] . 

(1.59) 

If p is invariant under exchange of any two clones, it is obvious that (1.59) is equal to 
(1.58). i.e. for symmetric doners the two concepts are completely equivalent. This 
justifies our definition of fidelity via the relative frequency in the case of photon 
cloning (cf. Sec. 1.4). 

Let us finally address the issue of optimality in the context of stimulated emission 
doners. In this paper we have shown the formal equivalence of our scheme and 
the optimal schemes for qubit cloning. As a consequence, the fidelity of the clones 
saturates the bounds derived for the cloning of qubits. However, it is not entirely 
obvious that the bounds derived for the situation of distinct well-localized qubits 
also apply to our situation. Could one maybe achieve even higher fidelity in our 
one-mode case? The following argument shows that the bounds indeed apply in our 
situation as well, i.e. that photon cloning is not allowed to be better than qubit 
cloning. 
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Let us assume that we had a single-mode cloning machine that clones photons with 
a better fidelity than given by the bounds for qubits. Consequently, the relative 
frequency of "correct" photons has to exceed the bound for at least one value of the 
final total photon number M. This is obvious if M has been fixed by post-selection. 
Otherwise the fidehty has to be defined as the average of the relative frequencies over 
all final total photon numbers. This average can only exceed the bound for qubits if 
the bound is violated for at least one particular value M of the final photon number. 

As a consequence, we have a universal map from the A'"-photon Hilbert space to the 
M-photon Hilbert space that achieves a relative frequency of correct photons in the 
final state that is higher than the qubit bound. But the existence of such a map is 
equivalent to the existence of a universal map from the totally symmetric A^-qubit 
space to the totally symmetric M-qubit space with a single-particle fidelity equal to 
the relative frequency. The existence of the latter map is excluded by the theorems 
on cloning of qubits [91]. This justifies our claim that the schemes presented in the 
previous sections realize optimal cloning of photons. 



1.7 Why are our doners optimal? 

On the previous pages we have shown that optimal cloning can indeed be realized 
by stimulated emission. The states produced by our simple quantum optical model 
systems consisting of two photonic modes and an ensemble of Lambda systems are 
exactly the same as those derived by Buzek and Hillcry and Gisin and Massar. But 
why is that so? An element of wonder seems to remain. It follows from symmetry 
considerations that our systems should act as universal doners, but a priori opti- 
mahty was not necessarily to be expected. We have seen in subsection 1.4.5 that 
not all systems which have the required symmetries also lead to optimal universal 
cloning. 

The reasons behind the optimality of our cloning procedure can be understood by 
remembering our construction of the optimal universal cloning transformations in 
section 1.3, where we saw that the output of the optimal doner is given by the 
projection of the state 

Ipl . . .ll)N Sn+1,M+1 ■ ■ ■ Sm,2M-N (1.60) 

onto the completely symmetric subspace of the first Af qubits, where S can be 
replaced by the maximally entangled state ^(|0)|0) -|- |1)|1)). 

On the other hand, one can show that the final states of our stimulated emission 
doners are always linear combinations of states of the following form: 

(al&l + 46^)^-V|0), (1.61) 
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where the c modes are disregarded because they just count the number of photons, 
as explained above. Here an initial state with N photons in mode 1 was assumed 
and we have already chosen the exponent M — in such a way as to facilitate the 
comparison to (1.60). That the final state can indeed be expressed in terms of (1.61) 
can be seen by noting that all terms generated during the time evolution will be of 
the form 

Wi'^IO) (1.62) 

for some k, where 

H ^ -fc{a{b\ + albl) + h.c. (1.63) 

Furthermore the commutator 

[aibi + 02^2, albl + albl] = iV„ + iVf, + 1, (1.64) 

where Na = a\ai + 0302 etc. This means that we can get rid of all annihilation 
operators by commuting them to the right. 

JrN 

The similarity between the two expressions (1.60) and (1.61) is obvious. The a[ in 
(1.61) corresponds to the N instances of ip in (1.60), while the (alfci + 46^)*^-^ cor- 
responds to the M—N instances of S. The projection onto the completely symmetric 
subspace is built in automatically in (1.61) through the commutation properties of 
the bosonic operators. This intuitive explanation can be checked by explicitly eval- 
uating the completely symmetrized component of (1.60), and (1.61). One verifies 
that the density matrices of the a-modes arc indeed identical in both cases. Thus 
we have finally understood the formal equivalence between our stimulated emission 
cloning procedures and the classical optimal doners. 



1.8 Conclusions and Outlook 

One may feel that with the remarks in the previous section the work is really com- 
pleted. Cloning via stimulated emission has been shown to be realizable, and the 
formal reasons for its optimality are understood. Furthermore, we proposed a con- 
crete experimental realization which should lead to results in the near future. The 
initial intuition probably shared by many physicists that a gain medium is some- 
thing like a doner was thus shown to be entirely correct. Our study demonstrates 
the intimate connection between the apparently deeply quantum-field-theoretical 
concept of stimulated emission and the quantum-information concept of cloning. It 
is the author's hope that there may be more things to learn about quantum field 
theory by looking at it from a quantum- information point of view. 

Another moral of the present work refers to the technology of quantum information. 
It reminds us that at least for specific tasks there may sometimes be more natural 
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and therefore possibly also more practical implementations than quantum comput- 
ing networks. Concerning possible practical applications of the present work it is 
worth mentioning that the optimal universal doner constitutes the optimal eaves- 
dropping method in some protocols for quantum cryptography [6] , so conceivably a 
future Eve could rely on stimulated emission. We have discussed a possible imple- 
mentation using parametric down-conversion in some detail. Other implementations 
might be possible, most notably based on combining cavity QED and Bose-Einstein 
condensation. It should be mentioned that parametric down-conversion-like Hamil- 
tonians can be realized for BECs, which makes them natural candidates for the 
implementation of the cloning of atomic states. 
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2.1 Introduction 

The special theory of relativity is one of the cornerstones of our present scientific 
world-view. One of its most important features is the fact that there is a maxi- 
mum velocity for signals, i.e. for anything that carries information, identical to the 
velocity of light in vacuum. Within the special theory of relativity, superluminal 
communication would immediately lead to all kinds of causal paradoxes, e.g. one 
would be able to influence one's own past. 

Another cornerstone of our present understanding of the world is quantum physics. 
Quantum physics seems to have "nonlocal" characteristics due to quantum entangle- 
ment. Most importantly, it is not compatible with local hidden variables, as shown 
by the violation of Bell's inequalities [7], which has been experimentally confirmed 
in several experiments [5, 90]. 

It is very remarkable that in spite of its non-local features, quantum mechanics 
is compatible with the special theory of relativity, if it is assumed that operators 
referring to space-like separated regions commute. In particular, one cannot ex- 
ploit quantum-mechanical entanglement between two space-like separated parties 
for communication of classical messages faster than light [40] . 

This peaceful coexistence between quantum physics and special relativity has led 
physicists to ask whether the principle of the impossibility of superluminal commu- 
nication, which we will refer to as the "no-signaling condition", could be used as 
an axiom in deriving basic features of quantum mechanics. Here we show that it 
is indeed possible. If the usual kinematical characteristics of quantum mechanics 
are assumed, then its dynamical rules can be derived from the no-signaling assump- 
tion. By quantum kinematics we mean the following: the states of our systems are 
described by vectors in a Hilbert space, and the usual rules for the results of measure- 
ments apply, including the projection postulate. However, no a priori assumption is 
made about the time evolution of the system. 

Our result is then, more precisely, that under the stated conditions the dynamics 
of our system has to be described by completely positive (CP) linear [77] maps. 
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This is equivalent to saying that under the given assumptions quantum mechanics is 
essentially the only option since according to the Kraus representation theorem [77] , 
every CP map can be realized by a quantum-mechanical process, i.e. by a unitary 
(linear) evolution on a larger Hilbert space (while on the other hand any quantum 
process corresponds to a CP map). This rather surprising result is an extension of 
earlier work by N. Gisin [41]. 

In the following, we will first recall how super luminal communication is impossible 
in quantum mechanics in spite of the existence of entangled states, as a consequence 
of the linearity of quantum dynamics. Then we show that quantum kinematics and 
no-signaling imply quantum dynamics. The argument proceeds in two steps. Firstly, 
it is shown that the existence of entanglement, the projection postulate and the no- 
signaling constraint imply linearity. Secondly, complete positivity of the dynamics 
follows from the the existence of entanglement and linearity. 

2.2 No— signaling in Quantum Mechanics 

Consider two parties, denoted by Alice and Bob, who are space-like separated, which 
implies that all operations performed by Alice commute with all operations per- 
formed by Bob. (Throughout this work we will assume that in this sense locality is 
implemented in the quantum kinematics.) Can they use a shared entangled state 
ipAB in order to communicate in spite of their space-like separation? 

The short answer is: no, because the situation on Bob's side will always be described 
by the same reduced density matrix, whatever Alice chooses to do. All the effects 
of her operations (described by linear maps) disappear when her system is traced 
over. This answer is correct, but not very detailed, and thus it may not be entirely 
convincing. In particular, a question that is frequently raised in this context is the 
following: Alice could choose to measure her system in two different bases and thus 
project Bob's system into different pure states depending on the basis she chose and 
her measurement result. Since it is possible to distinguish two different states in 
quantum mechanics, at least with some probability, shouldn't it be possible for Bob 
to infer her choice of basis, at least in some percentage of the cases (which would be 
dramatic enough)? 

Of course, the answer is no again, for the following reason. In order to gain in- 
formation about which basis Alice chose to measure. Bob can only perform some 
(generalized) measurement on his system. Then he has to compare the conditional 
probabilities for this result to occur, for the case that Alice measured in the first 
or in the second basis. But these conditional probabilities will always be exactly 
the same for both possibilities. This can be seen as a consequence of the linearity 
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of quantum mechanics: Suppose that Ahce's first choice projects Bob's system into 
states ipi with probabihties Pi and her second choice projects it into states 0^ with 
probabihties g^^. Bob can calculate the probability for his obtained result in every 
one of the states, and then weight these probabilities with the probability to have 
this specific state. But because of the linearity of any operation that Bob can per- 
form on his states during his generalized measurement procedure, his final result will 
only depend on the density matrix of the probabilistic mixtures, which is the same 
in both cases, because they were generated from the same entangled state. For an 
example how two such mixtures can become distinguishable through a non-linear 
(non-quantum-mechanical) evolution, see [42]. 

Let us note that this argument also implies the non-existence of a perfect doner in 
quantum mechanics because such a machine would allow superluminal communica- 
tion [50]. The impossibihty of perfect cloning can also be shown directly from the 
linearity of quantum mechanics [92]. 

2.3 No-signaling and Linearity 

In this section we show how quantum dynamics can be derived from quantum kine- 
matics and the no-signaling condition. By quantum kinematics we mean that the 
usual Hilbert space-structure (including entanglement) and the projection postu- 
late are assumed. The probabilities of the results of measurements are assumed to 
be determined by the density matrices of the systems in the usual way. Thus if we 
consider a subsystem of the whole Universe it will in general be in an entangled state 
with other parts of the Universe. In particular, it may also happen that a system 
denoted by A is entangled with another system B which is space-like separated with 
respect to A, such that their observable algebras do commute. This is where the 
no-signaling constraint comes into play. The dynamics of the systems has to be such 
that in spite of this entanglement no superluminal communication between A and 
B is possible. 

Suppose that A and B together are in the entangled state \iP)ab with reduced 
density matrix pA for system A. As a consequence of the projection postulate, by 
performing a measurement of his system the observer B also prepares a certain state 
m A. In particular, every probabilistic mixture of pure states corresponding to the 
density matrix pA can be prepared via appropriate measurements on B (for a proof 
see Sec. (2.4) and Ref. [52]). 

Consider two such probabilistic mixtures {P^^,Xi} and {-P^^j^/j}, where Py,^. is the 
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projector corresponding to the pure state jz/'fc) and is its probabihty, such that 

^i^A = 12 VjPh = Pa- (2-1) 

i 3 

According to the no-signahng principle there should be no way for the observer in 
A to distinguish these different probabilistic mixtures. 

A general dynamical evolution in system A is of the form 

g:P^^ g{P^) (2.2) 

where, most importantly, g is not necessarily linear. Furthermore, g{P^) does not 
have to be a pure state. Firstly, it could evolve into a mixed state. Secondly, if 

■0 evolves into a probabilistic mixture, then we define g{P^) to denote the corre- 
sponding density matrix. Under such dynamics the probabilistic mixture {P^^^jXk} 
goes into another probabilistic mixture {g{P^^,),Xk}- Therefore the two final density 
matrices after the action of g on two different probabilistic mixtures {P^^,Xi} and 
{P^j,yj} are 

i 

p'A{P<t>,,y3}-Ey39(P<t>,) (2.3) 

3 

which a priori can be different. Let us recall that according to our assumptions the 
results of all measurements in A are determined by the reduced density matrix p^. 
This means that as a consequence of the no-signaling principle the density matrix 
p'j^ at any later time has to be the same for all probabilistic mixtures corresponding 
to a given initial density matrix p^. That is, it has to be a function of pa only. 

We can therefore write 

PA = 9{PA)=g{J2^^PA)■ (2-4) 
i 

From the above it also follows that 

PA = J2^i9{Pi^J, (2.5) 

i 

therefore g satisfies the condition 

9il2^^PA)=12^i9{PA), (2.6) 

i i 

which implies that the map g is linear. Let us stress that there arc three crucial 
ingredients in our argument: the existence of entanglement, the projection postu- 
late, and the no-signaling condition. Specifically, the projection postulate leads to 
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probabilistic mixtures and thus to the right-hand side of Eq. (2.6). On the other 

hand, the no-signaling condition tells us that the dynamics can depend only on the 
reduced density matrix, which leads to the left-hand side of Eq. (2.6). Positivity is 
necessary in order to ensure that g{pA) is again a valid density matrix, i.e. to ensure 
the positivity of all probabihties calculated from it. 

As we have made no specific assumptions about the system A (apart from the fact 
that it can be entangled with some other system), this means that the dynamics of 
our theory has to be linear in general. 

Let us now argue that linearity and positivity already imply complete positivity in 
the present framework. To see this, consider again two arbitrary subsystems A and 
B which may again be in an entangled state \ijj)AB- Now it is conceivable that system 
A is changed locally (i.e. the system evolves, is measured etc.), which is described 
by some operation qa, while nothing happens in B. This formally corresponds to 
the operation qa® on the whole system. Strictly speaking, we have made an 
additional assumption here, namely that the identity operation 1b on a subsystem 
is a valid physical operation. 

The joint operation qa ® 1b should take the density matrix of the composite system 
Pab into another valid (i.e. positive) density matrix, whatever the dimension of the 
system B. But this is exactly the definition of complete positivity for the map qa [77]. 
If Qa is positive but not CP, then by definition there is always some entangled state 
Pab for which qa ® 1b{pab) is no longer a positive density matrix and thus leads to 
unphysical results such as negative probabilities. Let us recall that transposition of 
system A is an example for a positive but non-CP map. 

In this way the existence of entangled states and the requirements of positivity 
and linearity actually force us to admit only completely positive dynamics. As 

mentioned already in the introduction, this is equivalent to saying that under the 
given assumptions quantum dynamics is essentially the only option since any CP 
map can be realized by a quantum mechanical process, and on the other hand, any 
quantum-mechanical process corresponds to a CP map. 

Let us recall once again our starting assumptions: these were the existence of entan- 
glement, the projection postulate, the no-signaling condition, and, strictly speaking, 
the assumption that the identity operation on a subsystem is a permitted dynamical 
evolution. Nonlinear modifications of quantum mechanics [34] have to give up at 
least one of these assumptions. For instance, if the dynamics is allowed to depend 
only on the reduced density matrix pA, but in a nonlinear way, then it is clear that 
Pa cannot correspond to a probabilistic mixture of pure states. But pa will corre- 
spond to such a mixture whenever the observer in B chooses to make an appropriate 
measurement, as long as we believe in the projection postulate. This implies that 
the projection postulate has to be modified in such a nonlinear theory. Another 
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example would be a theory where some entangled states are a priori excluded from 
the kinematics. In this case some non-CP maps might be permissible. An extreme 
example would be a theory without entanglement. Such a theory would of course 
be in conflict with experiments. An example for a linear, positive, but non-CP map 
consistent with the no-signaling condition is the transposition of the density matrix 
of the whole Universe. However in this case the identity operation on a subsystem 
is not an allowed dynamics. 



2.4 Preparation of any mixture at a distance 

Let us now show that any mixture corresponding to a given density matrix can be 
prepared at a distance from any entangled state with the correct reduced density 
matrix [41, 52]. Let us denote the system under consideration by A and the remote 

system by B. An immediate requirement on the state of the joint system \iP)ab in 
order to achieve this is that it needs to have the correct reduced density matrix pA- 
Let us denote the eigenvector representation of pA by 

i2^k\vk){vk\. (2.7) 

k=l 

Then \iP)ab must have a Schmidt decomposition 

r 

\^)AB^T.y>^k\Vk)\9k): (2.8) 
k=l 

where the \gk) are orthonormal states of system B. We want to show that any 
decomposition of pa as a mixture of pure states can be prepared from this state by 
operations on system B only. To this end, consider an arbitrary decomposition 

m 

= (2.9) 

i=l 

where in general m > r. Clearly this decomposition could be obtained from a state 

m 

\cP)AB = Y.V^^\^^)\^^)^ (2-10) 
i=l 

with the \ai) being an orthonormal basis of a m-dimensional Hilbert space H^. It 
seems that we now require a larger Hilbert space in location B in order to accommo- 
date all the orthonormal jaj). But the state also has a Schmidt representation 

\<l>)AB = j2^k\Vk)\hk), (2.11) 
k=l 
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which imphes that and \iI^)ab are connected by a unitary transformation on 

B alone: 

\(t>)AB=lA®UB\ll^)AB, (2.12) 

SO the dimension of the support of the reduced density matrix Pb is the same for 
both states. This means that \(t))AB can be rewritten as 

m 

\4>)ab = Ps\<P)ab = Y.^i\A)\&i). (2.13) 

where \a.i) = Ps\oii) and Ps is the projector onto the support of ps, i.e. the span of 
the vectors \hk). The vectors \6Li) live in this r-dimensional span. Let us recall that 
\(f>)AB can be generated from by the local unitary Us- 

The states I'^j) can now be prepared with the help of the POVM 

m 

El«0(«il = l (2.14) 

i=l 

on system B. Note that the \a,^ form a POVM by construction: They can be 
extended to the orthonormal basis on the larger Hilbert space H^n- The 

proof of our above statement uses exactly this extension. For a general POVM 
XI-P/ = 1 on system B the result i prepares the state p\ — TiCB{Pi)BPAB in system 
A. Therefore the state prepared by the result corresponding to \ai) is equal to 
p\ — TrB\o(i){ai\\(l))AB{'t>\AB- which can be rewritten, formally extending the Hilbert 
space dimension for system B, as 

TrBPs\ai){ai\Ps\4>)AB{(p\ = TrB\ai){ai\\(f))AB{(f>\ = Xi\iPi){iPi\. (2.15) 

This completes the proof of our above statements [77]. Any decomposition of Pa in 
a mixture of pure states can be obtained from the state lip) ab- 

2.5 Bounds on cloning from positivity and 
linearity 

The motivation for the present section is partially historical. In [43] the no-signahng 
condition in the form of [41], which was just positivity and linearity, was used to 
derive a bound on the simplest doner. According to our above argumentation, all 
of quantum dynamics can be derived from the no-signaling condition, so the bounds 
from no-signaling are identical to the quantum-mechanical bounds, which in the case 
of cloning are known [18, 91]. Nevertheless, the method of [43] is quite convenient 
for deriving bounds on universal quantum machines [84] . We illustrate it here for the 
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case oil ^ N cloning, thus providing an ahernative (maybe more direct) derivation 
of the corresponding bounds. We will first recall Gisin's treatment of the 1 — > 2 
doner, then we generalize to the 1 ^ A'" case. 

Let the initial state of the input qubit be denoted by po = + -m) = \+m){+rh\, 
where a-m\ +m) = +\m). The output two-qubit density matrix is denoted by p{m). 
From the discussion of the preceding section we know that p(m) has to be a linear 
function of m. 

The output density matrix p(m) is further constrained by the requirement of uni- 
versality, which takes the form 

p{Um) ^U® Up{m)U^ ® (2.16) 

for all unitary operators U. This implies that p{rh) depends only on m and on no 
other privileged direction. Thus, if p(m) is written in the basis of matrices 

1 ® 1,1 ®ai,ai®l,ai®ak, (2.17) 

the coefficients can only depend on the components of m and on the invariant 
tensors 5ij and eij^. 

It is thus necessarily of the form: 

p{m) = ^ (1 (g) 1 + r]ima (g) 1 + 772I <8) ma + t a ®a + txym{a A a)) (2.18) 

where rii,ri2,t,txy are real parameters. In order for p(m) to be a physical density 
matrix, its eigenvalues have to be non-negative. A simple calculation shows that 
this implies 

l + t±{r]i + r]2) > 
l-t± ^4^2 + itiy + (r/i - ri2y > (2.19) 

In the case of symmetric cloning, the task is to optimize the fidelity F — Tr{p{m) P^® 
1), where = | -|- m){-\-m\, assuming rji — r]2 = rj. A simple calculation leads to 
the optimal values txy = O.t = 1/3, 77 = 2/3, for which F = | . Note that this also 
optimizes Tr{p{m) (g) P^h) = |- These are exactly the bounds that are valid in 
quantum mechanics, cf. Sec. 1.2. 

Now we show, that the above result can be generalized to the case of 1 — > iV 
cloning. Firstly, from any non-covariant and non-permutation invariant cloning 
transformation that produces a number of copies that are scaled versions of the 
input qubit with identical scaling factors, by averaging over unitary transformations 
and permutations one can get a covariant and permutation invariant transformation 
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without affecting the quahty of the copies [57] . This means that in deriving bounds 
we can restrict our attention to covariant and permutation invariant output density 
matrices. 

Secondly, as shown above, it follows from no-signaling that pout has to be linear in 
the Bloch vector of the input qubit. 

Thus for building the output density matrix we are only left with rrii and 5jk as 
possible coefficients in the Pauli matrix representation, where rrii can only occur 
linearly. The invariant tensor eijk is excluded by the requirement of permutation 
invariance. Possible terms are: 

1 1 (8) ... (g) 1 
(Tfe (8) CTfc (g) 1 (g) ... (g) 1 + perm. 
(Jfe <8) CTjfc <g) (7; (g) (7/ (g) 1 (g) ... (g) 1 + perm. 

a-m(g)l(g)...(g)l + perm. 
a • m (g) CTfc (g) CTfc (g) 1 (g) ... (g) 1 -\-perm. 

(2.20) 

where summation over repeated indices is understood. Because of universality we 
can choose m e.g. along the z-axis. Then one can convince oneself that all the above 
terms can be generated by products of 

Jz^]^{(Tz®'^® ■■■®^ + ■■■) (2.21) 

and 

J = JkJk, (2.22) 

the z-component of the total angular momentum and its square respectively. can 
occur only linearly, while for higher powers are possible: one has to distinguish 
the cases of even and odd. For N = 2k {J'^Y^ can go up to n = A; and J^(J^)" 
up to n = A; — 1, while for = 2A; + 1 (J^)" can go up to n = A; and Jz{J'^y^ up to 
n = A; as well. Higher powers are linearly dependent. We will denote the maximum 
possible values as n^ax a-nd n^^^ in the following. Note that n^ax + ^mao; — N — 1. 

Therefore the most general output density matrix can be written in the following 
way: 

n' 

^max max 

Pout = /?0l + E f^niJ'T + "0^. + E ^nJziJ'T. (2.23) 
n=l n=l 

where 1 now denotes the unit matrix in the A'"-particle Hilbert space. Our task is 
to find coefficients ccj and /3j such that the scaling factor of an individual clone is 
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maximal. The constraints are given by the requirements of positivity and normal- 
ization: all eigenvalues of Pout have to be positive, and its trace has to be equal to 
unity. 

In order to express the positivity constraint one has to diagonalize the matrix, but 
in the present formulation this is trivial. The eigenvectors are just the angular 
momentum eigenvectors |j, m, 7j), where 7^ runs over the different irreducible rep- 
resentations for a given j, i.e. jj = l,...,dj, where dj denotes the number of irr. 
reps for j. This means that the eigenvalues are given by 

n' 

A(j, m) = /3o + MU + 1))" + «om + E <^nm{j{j + 1))". (2.24) 

n=l n=l 

(The eigenvalues do not depend on 7^ .) Positivity implies that 

A(j,m)>0 Wj,m. (2.25) 
The normalization constraint is 

Trp„„, = /3o2^ + E PnTrif'r = h (2-26) 

n=l 

because the trace of the terms with is zero. This can be expressed as 

rimax 

/5o2^ + E E PndA^J + + 1))" = 1. (2.27) 

1 j—jmin 

For N = 2k jmin = 0, while for iV = 2A: + 1 jmin = 1/2. 

We still have to determine the scaling factor of the individual clones, which is the 
quantity that we want to maximize. This requires calculation of the one-particle 
reduced density matrix, which in our case must have the form 

TrN-iPout^\{l + sa,). (2.28) 
We want to maximize the coefficient of cr^. The terms in pout leading to a are 

"■'max 

aoTvN-iJz + E auTrN-iUf'T. (2.29) 

n=l 

The scaling factor s is obtained by multiplication with and tracing over the 
remaining particle. Using 

IVi (a.IVjv-i J.(J^)") = ^Tr^iJ^fiJ^r, (2.30) 
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iV/2 j 

TMJ.nj'r= E dMj + W E.^'' (2-31) 

J — Jmin ^ — J 

and 

E m' = p{j + l){2j + l) (2.32) 

m=-j 

one obtains 

2 "max ■'V/2 

s = ao2^-^ + ^ «n _E djj{j + l){2j + + l)r. (2.33) 

^ 1 j — jmin 

By the structure of Eqs. (2. 24), (2. 27), and (2.33) one is led to make the substitution 

n' 

max 

% = E + 1))" j = imm, N/2 

n=l 

bj = E MU + 1))" j = irnrn, -, (2.34) 

n=l 

Note that Oq and b^, which arise for even because jmin = 0, are identically zero. 
Note also that a priori this does not seem to be a good change of variables for the 
optimization because in general the aj and bj are not all linearly independent, as 
one can see by counting their number and comparing to the number of and 
We will discuss this problem in detail below when we present the real change of 
variables made, the above substitution is only an intermediate step. 

In these variables the optimization problem has the following form: 

+ bj + (do + aj)m > Vj, m 

N/2 

/3o2^ + _E bjdj{2j + 1) = 1 (2.35) 

3 — Jfnin 

N/2 

ao2^-^ + _E ajdjj{3 + l)(2j + 1) = Max.! 

3 — Jmin 

Noting that 

N/2 

j: ci,(2J + l) = 2^ (2.36) 

J — Jmin 

because it is the number of dimensions of all irreducible representations, and that 

N/2 

Y: d,{2j + + 1) = Tr(J^) = 37V2^-^ (2.37) 

J—Jmin 
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as can be checked by direct calculation of the trace, one is led to make the further 
redefinition 

Aj = {ao + aj)dj{2j + 1) j = N/2 

Bj = iPo + bj)dj{2j + 1) J = jrnin, N/2. (2.38) 

In terms of the independent variables and /3„ this reads 

ao + E «n(j(i + 1))" I j = imm, 

= d,(2j + 1) + + 1))") j = imm, N/2 (2.39) 

Now we have to face the question whether this is a legal change of variables, i.e. 
whether the Aj and Bj are linearly independent. Let's first discuss the second line 
of (2.39). There are rimax + 1 independent parameters on the right hand side, which 
is A; + 1 for A?" = 2A; and also A; + 1 for A?" = 2A; + 1 (see above). This is identical 
to the number of different possible values of j. This means that the number of 
Bj is the same as the number of the only question left is whether the matrix 
connecting the two sets of variables is invertible. This last point is easy to show. 
(It's determinant is a Van der Monde determinant.) 

Turning to the first line of (2.39) we see that the number of independent parameters 
on the right hand side is n^^^. + 1, which is k ior N — 2k and A; + 1 for = 2A; + 1. 
This means that there seems to be a problem for the case N — 2k, because one of 
the Aj is a linear combination of the others. Fortunately it turns out, as we will see 
below, that the variable Aq docs not play any role in the optimization, which allows 
us to disregard it. The other Aj for j ^ can be shown to be linearly independent 
exactly as the Bj . 

Having justified our change of variables, we can now study its consequences. It leads 
to the following set of conditions: 

A(j, m) = Bj + Ajfu > Vj, m 

N/2 

_E Bj^l (2.40) 

J — Jmin 

N/2 

M ,j: Ajj{j + l)^Max.\ 

J — Jmin 

From the first and third line one sees that Aq only enters multiplied by zero and 
therefore doesn't play any role, as mentioned above. From the first line it follows 
that the Bj have to be positive, and also that j\Aj\ < Bj. From the third line it is 
clear that negative values of Aj are not helpful, therefore one obtains 

jAj<Bj Vj. (2.41) 
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If all these inequalities are saturated, one gets 

2 

« = ^._E B,{j + l). (2.42) 

3 — Jmin 

Prom the above it is clear that the maximum is obtained if the value of Bj for the 
largest possible j, i.e. for j — N/2, is equal to unity, with all other Bj equal to zero. 
This leads to 

_ 1 2 

Smax — 2 + 37V' (2.43) 
which is exactly the maximum possible scaling factor in quantum mechanics. 



2.6 Conclusions 



We find it quite remarkable that what we referred to as quantum kinematics, i.e. 
the Hilbert space structure and the projection postulate, together with the no- 
signaling condition already constrains the dynamics to be of the form that we know 
from quantum mechanics: linear and completely positive. Concerning the challenge 
to truly derive quantum mechanics from some fundamental principles, the present 
result is certainly just a small piece of the puzzle. However, besides providing some 
insight into the interrelations between different properties of the standard theory, 
this result also leads to a clear statement about possible non-linear modifications of 
quantum mechanics, namely that they have to give up at least one of the assumptions 
made in our derivation. Although the author has some sympathy for the program of 
studying non-linear extensions, at present he is not sure which assumption he would 
be most willing to give up. 
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3.1 Introduction 

Most predictions of quantum mechanics are of a statistical nature, with the theory 
making probabihstic predictions for individual events. The question whether one can 
go beyond quantum mechanics in this respect, i.e. whether there could be hidden 
variables determining the results of all individual measurements, has been answered 
to the negative for local hidden variables by Bell's theorem [7]. Locality means 
that in such theories the results of measurements in a certain space-time region are 
independent of what happens in a space-time region that is space-like separated, in 
particular independent of the settings of a distant measuring apparatus. 

Bell's theorem refers to a situation where there are two particles and where the 
predictions of quantum mechanics are statistical. Furthermore, even definite (non- 
statistical) predictions of quantum mechanics are in conflict with a local realistic 
picture for systems of three particles or more [47, 46]. 

The Kochen-Specker (KS) theorem [85, 58, 8, 75] states that non-contextual hid- 
den variable theories are incompatible with quantum mechanics. Non-contextuality 
(NC) means that the value for an observable predicted by such a theory does not 
depend on the experimental context, i.e. which other co-measurable observables are 
measured simultaneously. 

To put the Kochen-Specker (KS) theorem in a proper context, let us briefly recall 
some basic facts about measurements in quantum mechanics. Let us first discuss 
sequential measurements. Consider the sequential measurement of two commuting 
observables A and B. Ideally the same values for A and B are found again and again 
for repeated measurements, provided that they are projective. A measurement of 
B does not seem to disturb the value of A. The quantum mechanical explanation 
for this phenomenon is that the first measurement of A and the first measurement 
of B project the system into a joint eigenstate or eigenspace of A and B in which it 
remains. 

For sequential measurements of non-commuting observables the situation is radi- 
cally different. For example, consider repeated measurements of and a^- When 
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measuring the sequence (T^, a^^ (Jz one may find equal to —1 in the third mea- 
surement after having found equal to +1 in the first one. This can be visualized 
with polarizers and photons. This shows that if there are hidden values for quantum 
mechanical observables they must necessarily be affected by the measurement. But 
originally, before a measurement was performed, there might still have been one pre- 
cise value for every observable, which was then influenced by the first measurement 
performed on the system. 

Let us now consider joint measurements of several observables. In quantum mechan- 
ics only joint measurements of commuting observables are meaningful, because only 
these have joint eigenstates onto which the measurement can project. One can ask 
the following question: Is it possible that for all observables there are hidden values 
that do no depend on which other observables are measured jointly? For sequential 
measurements we have already seen that measurements of non-commuting observ- 
ables have an effect. But here the situation is more subtle. We are talking about a 
situation where some observable A could be measured jointly with B or with C and 
we ask whether there can be an underlying theory such that the value for A does 
not depend on whether B or C are measured jointly, and such that this holds for 
all A, B and C. The KS theorem states that there can be no hidden values of this 
kind: measurements of commuting observables also matter. 

The KS theorem was an important station on the road leading to Bell's theorem. 
While one can argue that there is no very good justification for expecting non- 
contextuality, if A, B and C are all measured on a single particle, as in the original 
formulation of the KS theorem, this changes dramatically, if A on the one hand and 
B and C on the other hand can be measured on two particles in entirely difi'crcnt 
locations. One can say that Bell's discovery was that the KS result remains true in 
such a situation as well. For the hypothetical hidden values measurements of com- 
muting observables matter, even if they commute because of space-like separation. 

Let us briefly recall the setting of the original KS theorem. KS considered a single 

spin-one particle, the relevant observables arc the squares of the spin components 
along arbitrary directions, denoted e.g. by S^. for the direction x. These observables 
commute for orthogonal directions. They satisfy the constraint 

Sl + Sl + Sl^s{s + l)^2 (3.1) 

for all orthogonal triplets of directions {x^y^z}. 

The question of non-contextuality now poses itself in the following way: is it possible 
to assign values and 1 to all directions such that the constraint (3.1) is fulfilled? 
Thus the question of the existence of non-contextual hidden values becomes a col- 
oring problem on the sphere. The non-existence of such a coloring can be inferred 
from Gleason's theorem [45] . Kochen and Specker gave a direct proof by exhibiting 
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a finite set of directions (originally 117) that cannot be colored. Since then proofs 
that require only smaller numbers of directions have been found, see e.g. Ref. [74] . 

It is well known that Bell's theorem leads to possible experimental tests of local 
hidden variables by studying the violation of certain inequalities for correlation 
functions. While tests of local hidden variables can also be seen as tests of non- 
contextuality, as briefly explained above, so far there has not been an experiment 
based on the original form of the Kochen-Specker theorem. This would require 
testing that the constraint Eq. (3.1) is indeed fulfilled for all directions belonging to 
the Kochen-Specker set. But the message of the KS theorem is weaker than that of 
Bell's theorem: non-contextual hidden variables are a smaller class than local hidden 
variables. It should therefore be possible to find an experiment disproving non- 
contextuahty that is considerably simpler than the usual tests of Bell's inequalities. 
Furthermore, from the theoretical point of view, the KS argument is quite elaborate. 
It should be possible to reach the same conclusions in a much simpler way. Note 
that the GHZ argument can already be seen as a much simplified KS theorem. 
In the present chapter we show that the above programme can be realized. We 
present a simple argument against non-contextual theories which involves just a few 
observables and leads to a simple experiment. The present work was inspired by the 
work of Cabello and Garcia- Alcaine (GG) [24] . 

The experiment can be realized with single particles, using their path and spin 
degrees of freedom. It leads to a non-statistical test of non-contextuality versus 
quantum mechanics. In this respect it is similar to the GHZ argument against local 
realism. 

If the experiment is realized with photons, the setup that we shall present only re- 
quires a source for single photons (such as parametric down-conversion) and passive 
optical elements. In the following, we first show how a very direct experimental test 
of non-contextuality can be found, then we discuss our operational realization. 

3.2 A Simple Kochen-Specker Argument . . . 

Consider four binary observables Zi,Xi, Z2, and X2. Let us denote the two possible 
results for each observable by ±1. In a non-contextual hidden variable (NGHV) 
theory these observables have predetermined non-contextual values +1 or —1 for 
individual systems, denoted as v{Zi),v{Z2),v{Xi), and v{X2). This means e.g. 
that for an individual system the result of a measurement of Zi will always be v{Zi) 
irrespective of which other compatible observables are measured simultaneously. 

Now imagine an ensemble E of systems for which one always finds equal results for 
Zi and Z2, and also for Xi and X2. (Glearly, in order for this statement to be 
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meaningful, Z-^ and Z2, and and X-i have to be co-measurable.) In the NCHV 
theory this means that 

v{Zr) = v{Z2) and v{Xi) = v{X2) (3.2) 

for each individual system of the ensemble. Then there are only two possibilities: 
either v{Zi) = v{X2), which implies v{Xi) = v{Z2); or v{Zi) ^ v{X2)i which implies 
v{Xi) ^ v{Z2). We will see that this elementary logical deduction is already suffi- 
cient to establish a contradiction between NCHV theories and quantum mechanics. 

To this end, let us express the above argument in a slightly different way. Eq. (3.2) 
can be written as 

v{Z^)v{Z2) = v{XMX2) = 1. (3.3) 
Multiplying by v{X2)v{Z2) it immediately follows that 

v{Z^)v{X2)=v{X,)v{Z2). (3.4) 

Let us now introduce the notion of product observablcs such as Z1X2. By defini- 
tion, one way of measuring Z1X2 is to measure Zi and X2 separately and multiply 
the results; in general, there are other ways. In particular, if another compatible 
observable (e.g. X1Z2, cf. below) is measured simultaneously, it will in general not 
be possible to obtain separate values for Zi and X2. However, in a non-contextual 
theory, the result of a measurement of an observable must not depend on which 
other observables are measured simultaneously. Therefore the predetermined value 
v{ZiX2), for example, in a NCHV theory has to follow the rule [24] 

viZ^X2)^viZMX2). (3.5) 

In this new language, our above argumentation can be resumed in the following way: 

v{ZiZ2) = v{XiX2) = 1 ^ v{ZiX2) = v{XiZ2) (3.6) 

i.e. if our systems have the property expressed in Eq. (3.2), then the two product 
observables Z1X2 and X1Z2 must always be equal in a NCHV theory. Note that 
in general this prediction of NCHV can only be tested if Z1X2 and X1Z2 are co- 
measurable. 

It follows from the results of [24] that the prediction (3.6) leads to an observable 
contradiction with quantum mechanics. To see this, consider a system of two qubits 
and the observables [24] 

Z, := a(^\X, := a(^\Z2 := , := , (3.7) 

where a^^^ means the z-component of the "spin" of the first qubit etc. It is easy 
to check that this set of observables satisfies all the properties required above. In 
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particular, while and Xi, and Zi and X2, do not commute, the two product 
observables Z^X^ and X^Z^ do. Furthermore, the quantum-mechanical two-qubit 
state 

iV'i) = -^(1+^)1+^) + 1-^)1-^)) 

= ^(1+^)1+^) + 1-^)1-^)) (3.8) 

is a joint eigenstate of the commuting product observables Z^Z^ and X^X^ with both 
eigenvalues equal to +1. Therefore, on the one hand the ensemble described by this 
state possesses the property of the ensemble E discussed above (cf. (3.2)): the 
measured values of Z\Z2 and X1X2 arc equal to +1 for every individual system. On 
the other hand, quantum mechanics predicts for the state I'f/'i), that the measured 
value of ^'1X2 will always be opposite to the value of X^Z^- This can be seen by 
decomposing j-^i) in the basis of the joint eigenstates of the two commuting product 
observables Z-^X^ and X\Z2. 

iV^i) = ^(IXi,-i) + IX-m))> (3-9) 

with 



|Xi,-i) = 

^+z)\-z)-\-z)\+z)) 

= ^{\^^)\'rx)-\-z)\-x)) 
1 



72 



(|-x)|+^) + |+x)|-^)) 



-1 + ^)1-^) + 1-^)1 + ^)) 

= ^(l + ^)|-^) + |-^)l+^)) 
1 



(| + x)| + ^) 



x)\-z)). 



(3.10) 



and 



Z\X2\x\-\) = +|xi-i),^i^2|xi-i) = -|xi-i) 
^1^2|X-l,l) = -|x-i,i),^i^2|x-i,i) = +lx-i,i) 



(3.11) 
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Prom (3.9) and (3.11) one sees that l^'i) is a linear combination of exactly those 
joint eigenstates of Z1X2 and X1Z2 for which the respective eigenvalues are opposite, 
which means, of course, that in a joint measurement the two observables will always 
be found to be different. With Eq. (3.6) in mind, this implies that the ensemble 
described by cannot be described by any non-contextual hidden variable theory. 

Note that one would already have a contradiction if quantum mechanics only pre- 
dicted that the observed values of .^'1X2 and X1Z2 are sometimes different, but in 
fact the result is even stronger, with QM and NCHV predicting exactly opposite 
results. 

According to the argument presented in the previous paragraph, an experimental 
test of non-contextuality can be performed in the following way: (i) Show that 
Z1Z2 = 1 and X1X2 = 1 for systems prepared in a certain way. (ii) Determine 
whether Z1X2 and X1Z2 are equal for such systems. Note that in steps (i) and (ii) 
the observables Zi,Xi,Z2, and X2 appear in two different contexts. 

Quantum mechanics predicts that step (i) can be accomplished by constructing a 
source of systems described by the state \ipi) and measuring Z1Z2 and X1X2 on 
these systems. According to QM, both Z1Z2 and X1X2 will always be found to 
be equal to +1. This can e.g. be verified by measuring the pairs Zi and Z2 and 
Xi and X2 separately on many systems, and obtaining the values of Z1Z2 and 
X1X2 by multiplication. Alternatively, one could also perform joint measurements 
of Z1Z2 and X1X2 on individual systems, but for step (i) such joint measurements 
are not strictly necessary. On the other hand, step (ii) definitely requires a joint 
measurement of Z1X2 and X1Z2, because both negative and positive values are to 
be expected for Z1X2 and X1Z2, and we have to determine whether their values are 
equal or opposite for individual systems. 

3.3 ... Leading to a Possible Experiment 

One could consider realizing the above protocol with two particles. However, since a 
joint measurement of the two qubits is required it follows that locality is not an issue 
in the present experiment. This suggests looking for a single-particle realization for 
the sake of simplicity. 

In our single-particle scheme, the first qubit is emulated by the spatial modes of 
propagation (paths) of a single spin-1/2 particle or photon, and the second qubit 
by its spin (or polarization) degree of freedom [97, 33]. Spin-1/2 and photon po- 
larization are completely equivalent for our purposes. Our setup requires a source 
of polarized single particles, beam splitters, and Stern-Gerlach type devices. In 
practice, the experiment would be easiest to do with photons because all these 
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elements are readily available, in particular polarized single-photon states can be 
produced to excellent approximation via parametric down-conversion [39]. Besides 
their conceptual simplicity, single-photon experiments are attractive because very 
pure experimental conditions, in particular very high visibilities, can be achieved. 
Nevertheless, we will use the spin language in the sequel because it is more familiar 
to most physicists. 

Consider a situation where the particle can propagate in two spatial modes u and 
d, and let \z+), \z—) etc. denote the particle's spin states as before. Then the state 
iV'i) of Eq. (3.8) is mapped onto the one-particle state 

l={\u)\z+) + \d)\z-)). (3.12) 

The observables Zi, Xi, Z2, X2 are now represented by 

u){u\ — \d){d\ 
u'){u\ — \d'){d'\ 
z+){z+ I - \z-){z- I 

x+){x + I — \x—){x — I, (3.13) 

where \u') = j=^i\u) + \d)),\d') = ^(|«) - \d)).^x+) = ^(|z+) + \z-)),\x-) = 
■^{\z+) — \z—)), i.e. u' and d' denote the output modes of a 50-50 beam-splitter 
with inputs u and d, and \x+) and \x—) are the spin eigenstates along the x direction 
. Clearly, Zi and Xi act on the path, and Z2 and X2 on the spin degree of freedom. 

To illustrate the physical meaning of the states and observables in our scheme, we 
show in Fig. 3.1 how a state such as l^ipi) in the form of Eq. (3.12) can be prepared, 
and in Fig. 3.2 we show the devices that measure pairs of one-particle observables, 
such as Zi and Z2. 

The devices of Figs. 3.1 and 3.2 enable us to reahze step (i) of the protocol described 
above. As for step (ii). Fig. 3.3 shows how a device performing a joint measurement 
of Z1X2 and X1Z2 can be built out of the building blocks of Fig. 3.2. 

Instead of leading to detectors, the outputs of the device of Fig. 3.2b, which measures 
Zi and X2, are now connected to two replicas of the device of Fig. 3.2c, which 
measure Xi and Z2. That the device indeed performs a joint measurement of Z1X2 
and X1Z2 can be demonstrated by analyzing how it acts on the joint eigenstates of 
these two observables. 

Comparison with Fig. 3.2 shows that the first device separates the two eigenspaccs 
of the degenerate product observable Z1X2. Eigenstates of ^'1X2 with eigenvalue 
+1 are sent up, those with eigenvalue -1 are sent down. It is important to note that 
this is the only way in which eigenstates of Z1X2 are affected by the first device. 



Xi 

Z2 
X2 
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i.e. they have exactly the same form in terms of the two spatial modes leading to 
the respective subsequent XiZ'2-measuring device as they had in terms of the modes 
entering the first device. One could say that the first device "almost" performs an 
ideal Von Neumann measurement of the observable ^1X2. The difference to a Von 
Neumann measurement lies in the fact that the superposition between states with 
Z1X2 — 1 and Z-1X2 — —1 is not destroyed by the device but only made ineffective 
because the respective components of any incoming state enter completely separated 
subsequent devices. Detection of the particle behind one of those two subsequent 
devices is a Von Neumann measurement of X1Z2 and at the same time completes 
the measurement of ^1X2. As is evident from the structure of the device of Fig. 
3.3, the measurement of X1Z2 is performed by measuring Xi and Z2 separately as 
in Fig. 3.2c and multiplying the values. 

While any device that performs a state analysis in the basis of common eigenstates 
of ^1X2 and X1Z2 can be considered to perform a joint measurement of these two 
observables, the particular realization presented here has the merit of showing ex- 
plicitly that a joint measurement of two product observables is performed, and how 
the information that could have been obtained in the first stage of the measurement 
(the values of Zi and X2 separately) has to be partially erased in order to make the 
second stage possible. 

Let us now consider what happens when a particle in the state enters the device 
of Fig. 3.3. Recall from Eq. (3.9) that is an equally-weighted superposition 
of two states with opposite eigenvalues of ^1X2. Therefore the particle has equal 
amplitudes for entering cither of the two X1Z2 devices. Explicit calculation confirms 
that the particle can emerge only via one of those four outputs for which the values 
of ^1X2 and X1Z2 are opposite. As explained above, after it has been shown that 
Z1Z2 — X1X2 = 1 for our particle source. NCHV predict exactly the complementary 
set of outputs (for which .^'1X2 and X1Z2 are equal). Therefore the two theories 
give clearly confiicting predictions for observable effects on a non-statistical level. 
Of course, in a real experiment visibilities are never perfect, and one would have to 
use some kind of inequality to rigorously estabhsh the contradiction, (cf. [47]). 

The present scheme allows the simplest non-statistical experimental test of non- 
contextuality that is known to us. For a single-photon experiment that implements 
a statistical test of NCHV versus QM see [66]. Similarly to the original Kochen- 
Specker paradox it requires only a single particle (though two degrees of freedom). 
With the experimental setup consisting of a simple interferometer, it shows partic- 
ularly clearly that the appearance of the paradox is related to the superposition 
principle. 
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Figure 3.1: Possible way of creating the single-particle version of given in Eq. 

(3.12) using a standard Stern-Gerlach apparatus. A single particle with 
spin state \x+) = + spin along the positive x direc- 

tion, comes in from the left (spatial mode |a)). By the Stern-Gerlach 
device, which separates incoming states according to the z-components 
of their spin, this is transformed into the desired superposition state. 
The outputs u and d could be connected to the inputs of the devices of 
Figures 3.2 or 3.3 
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(Xi=+l,Z2=+l) 
(Xj =+1, Z2= -1) 
(Xj = -1, Z2= +1) 
(Xi=-l,Z2= -1) 

.(Xi=+l,X2=+l) 
(Xj=+l,X2= -1) 

■(Xi = -l,X2=+l) 
(Xi=-l,X2= -1) 



d) 



Figure 3.2: Devices for measuring pairs of the single-particle observables of Eq. 

(3.13). A particle comes in from the left. Note that in general the 
incoming states will have components in both spatial modes u and d 
and of different spin. The devices shown measure: a) Zi and Z2; b) Zi 
and X2; c) Xi and Z2; d) Xi and X2. BS in c) and d) stands for a 
50 — 50 beam-splitter (see main text), which changes the basis of path 
analysis from \u), \d), corresponding to a measurement of Zi, to \u'), \d'), 
thus leading to a measurement of Xi. In a) and c) the Stern-Gerlach 
apparatus are oriented along the 2;-axis (measurement of Z2), in b) and 
d) along the x-axis (measurement of X2). 
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ZlX2=^-l,XlZ2=^-l) 

ZiX2=fl,XiZ2 = -l) 
ZiX2=fl,XiZ2=fl) 
ZiX2=fl,XiZ2 = -l) 

ZlX2=-l,XlZ2=^-l) 

ZiX2=-l,XiZ2 = -l) 

ZlX2=-l,XlZ2=^-l) 
ZiX2=-l,XiZ2 = -l) 



Figure 3.3: Device for performing a joint measurement of ^1X2 and X1Z2. A device 
performing a joint measurement of Z1Z2 and X1X2 can be constructed 
in an analogous way. 
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4 Hidden— Variable Theorems for Real 
Experiments 



4.1 Introduction 

In the original derivation of hidden-variable theorems, such as the Bell theorem, 
certain idealizations were made. For example the detection efficiency was originally 
assumed to be perfect. The case of non-unit efficiency has since been treated in de- 
tail [28, 27]. As another idealization, the precision of the measurements performed 
is usually not considered. When considering experimental tests of the corresponding 
classes of hidden-variable theories, this appears to be an important point for the 
following reason. An essential feature of all the hidden-variable theorems is that 
observables have to appear in different experimental contexts in order for a contra- 
diction to be obtained (i.e. observables have to be measured simultaneously with 
different mutually exclusive observables). 

For example, as we have seen above, the Kochen-Spcckcr theorem concerns trying 
to assign values to all directions on the sphere subject to a constraint for triads 
of orthogonal directions. One can only arrive at a contradiction by considering 
several triads that have at least some directions in common. For Kochen-Specker 
experiments this implies that the observables corresponding to individual directions 
(i.e. the squares of the spin components along these directions) have to appear in 
different triads. 

At first sight the usual derivations of hidden- variable theorems seem to run into prob- 
lems when the finite precision of real experiments is taken into account, because then 
it seems impossible to ascertain that the same observable is really measured more 
than once in different experimental contexts. This question seems to be of particular 
relevance for the Kochen-Specker theorem in view of recent claims by Meyer that 
this theorem is "nullified" when the measurements have only finite precision [65]. 

This claim was based on the fact that it is possible to assign values to all rational 
directions of the sphere, which constitute a dense subset of all directions. This con- 
struction was generalized by Kent [56]. Meyer argued that, since by mcasTircments 
with finite precision one cannot discriminate a dense subset from its closure, this 
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implies that non-contextual hidden variables cannot be excluded by any real exper- 
iment of the Kochen-Specker type. However, Meyer did not construct an explicit 
non-contextual hidden-variable model for real experiments with finite precision. 

In the following we show how these questions can be resolved by providing a general 
method for the derivation of hidden-variable theorems for real experiments. In 
order to achieve this the concept of observable has to be changed in such a way 
that it has an operational meaning. For concreteness, imagine that an observer 
wants to perform a measurement of the spin square along a certain direction n. 
There will be a certain experimental procedure for trying to do this as accurately as 
possible. We will refer to this procedure by saying that he sets the " control switch" 
of his apparatus to the position n. In all experiments that we will discuss only a 
finite number of different switch positions is required. By definition different switch 
positions are clearly distinguishable for the observer, and the switch position is all he 
knows about. Therefore, in an operational sense the measured physical observable is 
entirely defined by the switch position. From the above definition it is clear that the 
same switch position can be chosen again and again in the course of an experiment. 

In general one has to allow for the possibility that the switch position n does not 
uniquely determine the physical state of the measuring apparatus, i.e. there may 
be (hidden) properties of the apparatus over which the observer does not have full 
control but which may infiuence the result of any given measurement. Following the 
philosophy of deterministic hidden variable theories, one therefore has to assume 
that the result of any measurement will be determined not only by the hidden 
properties of the system, but also by those of the measuring apparatus. 

In the present paper we do not discuss stochastic hidden variable theories explicitly. 
This does not limit the generality of the results derived because the existence of a 
stochastic hidden variable model for a given physical system implies that also an 
underlying deterministic model can be constructed which reproduces the probabil- 
ities of the stochastic model. Therefore e.g. ruhng out all possible non-contextual 
deterministic hidden-variable models implies ruling out all possible non-contextual 
stochastic models as well. 

4.2 Kochen— Specker Theorem for Real 
Experiments 

As a concrete application of the ideas expressed in the two preceding paragraphs, 
we are now going to show how non-contextual hidden variables can be excluded by 
real experiments. Let us note that local hidden variables can be ruled out using an 
equivalent approach. 
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In the original Kochen-Specker situation one considers a spin-1 particle. In the 
ideal case of perfect precision, the relevant observables are the squares of the spin 
components, denoted by S"- for arbitrary directions n. For a spin-1 particle one has 

Si + Si + Si = 2 (4.1) 

for every triad of orthogonal directions {ni, n2, '^ia}- As the possible results for every 
Sl are or 1, this implies that in the ideal case for every measurement of three 
orthogonal spin squares two of the results will be equal to one, and one of them will 
be equal to zero. 

Let us emphasize that in our approach the operational observables are defined by 
the switch positions (i.e. by the best effort and knowledge of the experimenter) and 
therefore are not exactly identical to the exact quantum mechanical observables. 
In the following the symbol S'i will denote the operational observable defined by 
the switch position n, and the term direction will be used as a synonym for switch 
position. 

In a deterministic hidden variable theory one assumes that for every individual par- 
ticle the result of the measurement of any observable S'i is predetermined by hidden 

properties. In non-contextna/ hidden variable theories it is furthermore assumed that 
this predetermined result does not depend on the "context" of the measurement, in 
particular which other observables are measured simultaneously with 5*1, but only 
on the switch position n and the hidden variables. 

In the ideal case one could define non-contextuality in such a way that the prede- 
termined value of some quantum mechanical observable X is required to be inde- 
pendent of the simultaneously measured observables only if they exactly commute 
with X . Note that only in the ideal case the observables corresponding to precise 
directions would have an operational meaning. It is evident that this weaker form 
of non-contextuality can only be tested in the idealized case of infinite precision. 

In general the result may depend both on the hidden properties of the system and 
of the apparatus. Let us denote the hidden variables of the system by A and those of 
the apparatus by ji. For further use, let us denote the ensemble of all possible pairs 
(A, //) by A. As explained above, the philosophy of non-contextual hidden variables 
implies the existence of a function S'?(A, /i) taking values and 1 which describes the 
result of a measurement with switch position n on a system characterized by A with 
an apparatus characterized by ii. For fixed A and fi this function therefore assigns 
a value or 1 to the switch position n. Let us note that the models discussed by 
Clifton and Kent [30] are not non-contextual in the present sense because in these 
models the result of a measurement of 5*1 in general does not only depend on A, ji, 
and n, but also on the other observables measured simultaneously. 

A Kochen-Specker experiment can now be performed by testing the validity of Eq. 
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(4.1) for a judiciously chosen set of triads of directions. Therefore the apparatus 
is required to have three switches where the three directions of a given triad can 
be chosen. Because the switch positions do not correspond to the ideal quantum 
mechanical observables the sum of the three results will not always be equal to 2. 
Nevertheless a contradiction between non-contextuality and quantum mechanics can 
be obtained in the following way. 

Prom the Kochen-Specker theorem it follows that there are finite sets of triads for 
which no value assignment consistent with Eq. (4.1) is possible [58, 74]. Let us 
choose such a Kochen-Specker set of triads 

{{ni, n2, na}, {ni, n4, ng}, ...,}. (4.2) 

Let us emphasize that at least some of the switch positions have to appear in 
several of the triads (clearly otherwise there could be no inconsistency). Let us 
denote the number of triads in the Kochen-Specker set (4.2) by N. The set is 
constructed in such a way that if one can show for some fixed A and /i that 

/^) + ^^(A, //) + SliX, iS) = 2 (4.3) 

is valid for — 1 of the triads {rij, n^, n^}, one obtains the prediction that it has to 
be violated for the final triad. 

Suppose that for the first triad {ni, n2, ^3} in the Kochen-Specker set one finds that 
the sum of the results is equal to 2 in a fraction greater than 1 — e of all cases. For 
the hidden variables this implies that 

SU\t^) + Sl{\,i,) + 4^(A,^) = 2 (4.4) 

for all (A,;u) G Ai, where Ai is some subset of the set of all hidden variables A with 
measure p(Ai) > 1 — e (by definition p(A) = 1). Suppose furthermore that one 
establishes in the same way for the second triad {ni, n4, ns} that 

Sl{\lA + Sl{\l,) + Sl{\l,) = 2 (4.5) 

for all (A, /x) G A2 with p{h.2) > 1 — e where in general A2 is a different subset of A, 
and so on for all A'" — 1 triads except the final one. 

This implies that for all (A, n) in the intersection of sets An := AinA2n...nAAr_i the 
sum of results is equal to 2. Consequently, because of the structure of the Kochen- 
Specker set the sum of the results for the final triad has to be different from 2 (i.e. 
0, 1 or 3) for all pairs (A, /x) G Ap. This leads to the experimental prediction that 
the sum of results will be different from 2 for the final triad in a fraction p(An) of 
all cases. From the property of sub-additivity (p(UjAj) < Z^iP(^i)) of the measure 
p it immediately follows that 

p(An) >1-{N- l)e. (4.6) 
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Therefore in order to experimentally disprove non-contextual hidden variables one 
only needs to show that the sum of results is equal to 2 in a fraction of all cases that 
is greater than (A^ — l)e. 

If we assume for simplicity that e is defined such that the fraction of " correct" (equal 
to 2) results is larger than 1 — e for all triads (including the final one) then the above 
results allow us to derive a bound on the size of the experimental imperfection e such 
that an experimental contradiction with non-contextuality can still be obtained: e 
has to be smaller than 1/N. Note that e describes all the imperfections of a real 
experiment including finite precision but also e.g. imperfect state preparation and 
non-unit detection efficiency. The value of and therefore of the bound on e depends 
on the particular Kochen-Spcckcr set used [58, 74]. 

As we have already noted above, an inevitable requirement for the contradiction to 
be obtained is the fact that the function 5'^-(A,/x), or in general functions corre- 
sponding to at least some switch positions, appear in more than one out of the N 
triads. This appearance of the same function in different lines of the mathematical 
proof (corresponding to different experimental contexts) is possible in spite of finite 
experimental precision only because we defined our observables operationally via 
the switch positions. 

We have shown how non-contextual hidden- variable theories can be disproved by real 
experiments. This clarifies questions raised by [65]. In view of our results, we would 
assert that the Kochen-Specker theorem is not "nullified" by finite measurement 
precision. Let us note that independent arguments in favor of this conclusion were 
given in [63, 3, 2, 60]. Our suggestion how to perform a Kochen-Specker experiment 
was inspired by some of Mermin's remarks in [63] . 

Using the same method one can also show that local hidden variables can be dis- 
proved in real experiments, e.g. using the GHZ [47] form of Bell's theorem which 
is also based on sets of propositions that cannot be consistently satisfied by hidden 
variables. Inequalities analogous to Eq. (4.6) can be derived and tested experimen- 
tally [73]. 

4.3 Hidden Variables: Perspectives 

In the previous chapter we have presented a very simple Kochen-Specker type argu- 
ment. It seems unlikely that much further simplification is possible. In this chapter 
we have analyzed the derivation of hidden-variable theorems for real experimental 
conditions. We have seen that the theorems, including those on non-contextuality, 
are robust under real-world conditions and thus experimentally testable. 
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Let us emphasize that 36 years after Bell (and 68 years after von Neumann) there 
are still interesting open questions in the field of hidden variables. Most impor- 
tantly, on the experimental side, a loophole-free demonstration of the violation of 
Bell's inequalities is still missing. Such an experiment would require both space-like 
separation of the measurements performed on each entangled pair and high detec- 
tion efficiency. For the realization of such experiments new ways of establishing 
contradictions between local hidden variables and quantum mechanics may be help- 
ful. E.g. Eberhard [38] showed that the required detection efficiencies are lower 
if non~maximally entangled states together with appropriate analyzer settings are 
used. A possible new approach would be to consider adaptive measurements [78], 
or in general joint measurements on several pairs. 

On the theoretical side, there are many connections between the study of general 
Bell's inequalities and the classification and quantification of entanglement, cf. e.g. 
[80]. A particularly interesting open question is whether the so-called "bound en- 
tangled" states [51], which are states from which no maximal entanglement can be 
distilled, admit local hidden variable models, i.e. whether there is entanglement 
without non-locality. 

One may also hope that a detailed understanding of the quantum weirdness, i.e. the 
differences between quantum mechanics and the classical world view, might help 
to generate ideas how to exploit it in order to perform tasks that are classically 
impossible. 
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During the three years of my PhD studies I had the good fortune to come into 
contact with many fields, some of which are not even mentioned in this thesis. 
Quite naturally I learned about various sub-fields of quantum information, ranging 
from cloning and quantum cryptography, over quantum computing, to the study 
of entanglement and its purification. I got to know quantum optics, another field 
which had not figured in my undergraduate studies, from the experimental and also 
somewhat from the theoretical side. I learned a lot about hidden-variable theorems 
and the related experiments and some basic but important facts about the practical 
aspects of decoherence, mostly from my experimental colleagues. Towards the end 
I was glad to learn some things about entanglement in quantum field theory, a 
topic I liked because it created a bridge to my earlier studies. I also found that my 
expectations had been correct: indeed there were many opportunities to discuss, 
learn and think about the basic questions of quantum physics, together with Anton 
Zeilinger, Caslav Brukner, and many others. 

One of the most important things that I learned is that it is very nice and that it can 
also be quite fruitful for a theorist to be in close (in my case: permanent) contact 
with an experimental group, especially if it is such a good one. Not least because 
your knowledge of physics is constantly tested by the questions of your experimental 
friends, which have a tendency to always be related to the real world. I learned other 
things which I think will be useful, such as writing a proposal, organizing a workshop. 
Once, we even made a movie. All this was usually done in a team of great people, 
from about ten different nations. So much for myself. 

What have we learned from quantum information in general? Most importantly, 
that, with the help of quantum physical systems, one can do things that are un- 
thinkable classically, starting of course with the discovery of Bell's inequalities. In 
the last years we have also learned a lot about what can be done in practice in the 
lab. This includes many amazing things, such as multi-particle entanglement [72], 
the interference of large molecules [4] , the study of single ions in traps coupled via 
single phonons [69], and single photons interacting with single atoms in cavities of 
incredibly high quality [49] . It is probably also fair to say that quantum information 
has led to a new way of looking at physics, for example we see entanglement almost 
everywhere. 
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Let me try to summarize the major challenges for quantum information at the 
present stage by two questions: What else could we do (with quantum systems), 
and what can we really build? The first question has a theoretical, the second an 
experimental flavor, but people from both sides are trying to find answers to both. A 
natural way of attacking the first one is to look for new quantum algorithms that out- 
perform classical ones. Physicists and computer scientists are also investing a lot of 
effort into trying to prove general results on the power of quantum computation. One 
may also feel that "quantum non-locality" has not yet been fully exploited. There 
must be more that one can do with distributed entanglement than violate Bell's 
inequalities and perform cryptography and teleportation. One promising result is 
the reduction of communication complexity [29] . 

As for the second question, the main goals are to build a quantum computer of 
serious size and to achieve quantum communication over long distances. Hope rests 
on the continuous improvement of solid state techniques [55] , for example for semi- 
conductors and super-conductivity, and also in new technological achievements, such 
as Bose- Einstein condensation [1] and the development of laser cooling [64]. The 
main difficulty in designing and building a quantum computer is to keep quantum 
coherence, where normally it is lost very fast. This is attempted using a combination 
of technological approaches such as cooling, isolation, the use of systems which 
have low decoherence by nature, and algorithmic methods, such as quantum error 
correction and fault-tolerant computing. 

Thus, there is a close connection between quantum computing and a more founda- 
tionally oriented research programme, the preparation and study of larger and larger 
superposition states, "Schrodinger cats". One of the driving hopes behind such a 
programme was formulated in the preface to this thesis: something new could turn 
up. We might find the limits of validity of quantum physics, maybe even something 
like the border of the classical world. At present such hopes, although no strangers 
to the author's heart, seem preposterous if not unreasonable. One argument in 
their favor which has some appeal for the author is that usually in physics linearity 
is an approximation. The connection between the linearity of quantum mechanics 
and special relativity should be kept in mind in this context, but it is well known 
that non-linear modifications of quantum physics are conceivable if some of its basic 
assumptions are given up. 

There are other more modest hopes for the future of quantum information. There is 
little doubt that we will learn a lot more about physics in a practical and quantitative 
sense. We also hope for new conceptual insights, for example concerning the relation 
of quantum and classical information. Quantum information should meet other areas 
of physics besides quantum optics, most prominently statistical physics and quantum 
field theory. New interesting physics should come out of such encounters. A more 
ambitious hope again is that the concept of information may help us to arrive at a 
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deeper understanding of the basic principles of quantum physics [16, 17, 94]. 

Personally I hope to use many of the things that I have learned and to continue 
working in a fascinating field dealing with fundamental questions and yet close to 
the real world of experiments. 
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